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Introduction 


On September 14, 1959, twelve days after passing through her point of closest approach 
to the Earth, the planet Venus was bombarded by pulses of radio waves sent from Earth. 
A handful of anxious scientists at Lincoln Laboratories in Massachusetts waited to detect 
the echo of the reflected waves. To their initial disappointment, neither the data from this 
day, nor from any of the days during that month-long observation, showed any detectable 
echo near inferior conjunction of Venus. However, a later, improved reanalysis of the data 
showed a bona fide echo in the data from one day: September 14. Thus occurred the first 
recorded radar echo from a planet. 

Exactly fifty-six years later, on September 14, 2015, a rather different signal was 
received by scientists, this time in Hanford, Washington, and Livingston, Louisiana. The 
signal was not electromagnetic but instead was a wave in the fabric of spacetime itself. 
It was the final burst of gravitational waves from two black holes that merged to form a 
single black hole somewhere in the southern sky some 1.3 billion years ago. The signal was 
recognized within minutes by automated data processing software. This time, the scientists, 
numbering over 1,000, were anxious lest the signal be an unlucky artifact of instrumental 
noise. But on February 11, 2016, after an intensive and secretive five months of detailed 
analysis, checks and cross-checks, they announced at a Washington, DC, press conference 
that the Laser Interferometer Gravitational-Wave Observatory (LIGO) had made the first 
direct detection of gravitational waves. 

As if the 100th anniversary of the general theory of relativity during 2015 was not 
already something to celebrate, the detection of gravitational waves was icing on the 
cake.! It was also the capstone of a half-century period during which general relativity 
experienced a remarkable renaissance, from a subject relegated to the backwaters of 
physics and astronomy, to one that is regarded as central to the major scientific questions 
of the day, from the nature of the fundamental particles to the fate of the universe. It was 
a perfect illustration of how the field was transformed from one that was once called “a 
theorist’s paradise and an experimentalist’s purgatory,” to one in which experimentalists 
and theorists work hand in hand. It highlighted the field’s evolution from the world of 
“small science,” where individuals or small groups scratched out mathematical formulas 
in their tiny offices, to that of “big science,” where worldwide collaborations of scientists 
conduct their affairs via telecon and Skype, spend budgets measured in units of megabucks 
or megaeuros, and require project managers to keep matters on track. 


' And a scoop of ice cream on top of the icing was the award of the 2017 Nobel Prize in Physics to three of 
LIGOs founders, Rainer Weiss, Barry Barish, and Kip Thorne. The other pioneer of the project, Ron Drever, 
had already died in March of that year. 
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The origins of this remarkable transformation of general relativity from an obscure niche 
of mathematics and physics to a major subfield of physics and astronomy, today called 
“Gravitational Physics,” can be found in a set of events of the academic year 1959-1960, 
beginning with that first radar echo from Venus. Four key events followed. 

On March 9, 1960, the editorial office of Physical Review Letters received a paper 
by Robert Pound and Glen Rebka Jr., entitled “Apparent Weight of Photons.” The paper 
reported the first successful laboratory measurement of the gravitational redshift of light. 
The paper was accepted and published in the April | issue. 

In June 1960, there appeared in volume 10 of the Annals of Physics a paper on “A Spinor 
Approach to General Relativity” by Roger Penrose. It outlined a streamlined calculus for 
general relativity based upon “spinors” rather than upon tensors. 

Later that summer, Carl Brans, a young Princeton graduate student working with Robert 
Dicke, began putting the finishing touches on his PhD thesis, entitled “Mach’s Principle 
and a Varying Gravitational Constant.” Part of that thesis was devoted to the development 
of a “scalar-tensor” alternative to the general theory of relativity. Although its authors never 
referred to it this way, it came to be known as the Brans-Dicke theory. 

On September 26, 1960, just over a year after the recorded Venus radar echo, 
astronomers Thomas Matthews and Allan Sandage and coworkers at Mount Palomar used 
the 200-inch telescope to make a photographic plate of the star field around the location 
of the radio source 3C48. Although they expected to find a cluster of galaxies, what they 
saw at the precise location of the radio source was an object that had a decidedly stellar 
appearance, an unusual spectrum, and a luminosity that varied on a timescale as short as 
fifteen minutes. The name quasistellar radio source or “quasar” was soon applied to this 
object and to others like it. 

These disparate and seemingly unrelated events of the academic year 1959-1960, in 
fields ranging from experimental physics to abstract theory to astronomy, signaled the 
beginning of a new era for general relativity. This era was to be one in which general 
relativity not only would become an important theoretical tool of the astrophysicist but 
also would have its validity challenged as never before. Yet it was also to be a time in 
which experimental tools would become available to test the theory in unheard-of ways 
and to unheard-of levels of precision. 

The optical identification of 3C48 (Matthews and Sandage, 1963) and the subsequent 
discovery of the large redshifts in its spectral lines and in those of 3C273 (Greenstein and 
Matthews, 1963; Schmidt, 1963) presented theorists with the problem of understanding 
the enormous output of energy from a region of space compact enough to permit the 
luminosity to vary systematically over timescales as short as days or hours. Many theorists 
tured to general relativity and to the strong relativistic gravitational fields it predicts, to 
provide the mechanism underlying such violent events. This was the first use of the theory’s 
strong-field aspect, in an attempt to interpret and understand observations. The subsequent 
discovery of the cosmic microwave background (CMB) radiation in 1964, of pulsars in 
1967, and of the first black hole candidate in 1971 showed that it would not be the last. 
However, the use of relativistic gravitation in astrophysical model building forced theorists 
and experimentalists to address the question: Is general relativity the correct relativistic 
theory of gravitation? It would be difficult to place much confidence in models for such 
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phenomena as quasars and pulsars if there were serious doubt about one of the basic 
underlying physical theories. Thus, the growth of “relativistic astrophysics” intensified the 
need to strengthen the empirical evidence for or against general relativity. 

The publication of Penrose’s spinor approach to general relativity (Penrose, 1960) was 
one of the products of a new school of relativity theorists that came to the fore in the late 
1950s. These relativists applied the elegant, abstract techniques of pure mathematics to 
physical problems in general relativity, and demonstrated that these techniques could also 
aid in the work of their more astrophysically oriented colleagues. The bridging of the gaps 
between mathematics and physics and mathematics and astrophysics by such workers as 
Bondi, Dicke, Sciama, Pirani, Penrose, Sachs, Ehlers, Misner, and others changed the way 
that research (and teaching) in relativity was carried out, and helped make it an active and 
exciting field of physics. Yet again the question had to be addressed: Is general relativity 
the correct basis for this research? 

The other three events of 1959-1960 contributed to the rebirth of a program to answer 
that question, a program of experimental gravitation that had been semi-dormant for forty 
years. 

The Pound-Rebka (1960) experiment, in addition to verifying the principle of equiva- 
lence and the gravitational redshift, demonstrated the powerful use of quantum technology 
in gravitational experiments of high precision. The next two decades would see further 
uses of quantum technology in such tools as atomic clocks, laser ranging, superconducting 
gravimeters, and gravitational-wave detectors, to name only a few. Recording radar 
echos from Venus (Smith, 1963) opened up the solar system as a laboratory for testing 
relativistic gravity. The rapid development of the interplanetary space program during 
the early 1960s made radar ranging to both planets and artificial satellites a vital new 
tool for probing relativistic gravitational effects. Coupled with the theoretical discovery 
in 1964 of the relativistic time-delay effect (Shapiro, 1964), it provided new and accurate 
tests of general relativity. For the next decade and a half, until the summer of 1974, the 
solar system would be the primary arena for high-precision tests of general relativity. 
Finally, the development of the Brans-Dicke (1961) theory provided a viable alternative 
to general relativity. Its very existence and agreement with the experimental results of the 
day demonstrated that general relativity was not a unique theory of gravity. Some even 
preferred it over general relativity on aesthetic and theoretical grounds. At the very least, it 
showed that discussions of experimental tests of relativistic gravitational effects should be 
carried on using a broader theoretical framework than that provided by general relativity 
alone. It also heightened the need for high-precision experiments because it showed that the 
mere detection of a small general relativistic effect was not enough. What was now required 
was measurements of these effects to accuracies of 10 percent, 1 percent, or fractions of a 
percent and better, to distinguish among competing theories of gravitation. 

To appreciate more fully the regenerative effect that these events had on gravitational 
theory and its experimental tests, it is useful to review briefly the history of general 
relativity in the forty-five years following Einstein’s publication of the theory. 

In deriving general relativity, Einstein was not particularly motivated by a desire to 
account for unexplained experimental or observational results. Instead, he was driven 
by theoretical criteria of elegance and simplicity. His primary goal was to produce a 
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gravitation theory that incorporated the principle of equivalence and special relativity in 
a natural way. In the end, however, he had to confront the theory with experiment. This 
confrontation was based on what came to be known as the “three classical tests.” 

One of these tests was an immediate success — the ability of the theory to account 
for the anomalous perihelion shift of Mercury. This had been an unsolved problem in 
celestial mechanics for over half a century, since the announcement by Urbain Jean Joseph 
Le Verrier in 1859 that, after the perturbing effects of the planets on Mercury’s orbit had 
been accounted for, there remained in the data an unexplained advance in the perihelion 
of Mercury. The modern value for this discrepancy is about forty-three arcseconds per 
century. A number of ad hoc proposals were made in an attempt to account for this excess, 
including the existence of a new planet, dubbed “Vulcan,” near the Sun, and a deviation 
from the inverse-square law of gravitation. A half century of astronomical searches for 
Vulcan yielded numerous claimed sightings, but in the end, no solid evidence for the planet 
was found. And while a change in the Newtonian inverse-square law proposed by Simon 
Newcombe, from the power 2 to the power 2.000000157, could account for the perihelion 
advance of Mercury, it ultimately conflicted with data on the motion of the Moon. 

Einstein was well aware of the problem of Mercury, and, in fact, he used it as a way 
to test his early attempts at a theory of gravity; for example, he finally rejected the 1912 
“Entwurf” or “draft” theory that he had developed with Marcel Grossmann in part because 
it gave the wrong perihelion advance. But when he thought he had obtained the final theory 
in November 1915, the fact that it gave the correct advance convinced him that he had 
succeeded. 

The next classical test, the deflection of light by the Sun, was not only a success, it was 
a sensation. Shortly after the end of World War I, two expeditions organized by Arthur 
Stanley Eddington set out from England: one for Sobral, in Brazil; and one for the island 
of Principe off the coast of Africa to observe the solar eclipse of May 29, 1919. Their goal 
was to measure the deflection of light as predicted by general relativity: 1.75 arcseconds 
for a ray that grazes the Sun. The observations had to be made in the path of totality of 
a solar eclipse, during which the Moon would block the light from the Sun and reveal 
the field of stars around it. Photographic plates taken of the star field during the eclipse 
were compared with plates of the same field taken when the Sun was not present, and the 
angular displacement of each star was determined. The results were 1.13 + 0.07 times the 
Einstein prediction for the Sobral expedition, and 0.92 + 0.17 for the Principe expedition 
(Dyson et al., 1920). The November 1919 announcement of these results confirming the 
theory caught the attention of a war-weary public and helped make Einstein a celebrity. 
Nevertheless, the experiments were plagued by systematic errors, and subsequent eclipse 
expeditions did little to improve the situation. 

The third classical test was actually the first proposed by Einstein (1908): the gravi- 
tational redshift of light. But by contrast with the other two tests, there was no reliable 
confirmation of it until the 1960 Pound-Rebka experiment. One possible test involved 
the red shift of spectral lines from the sun. A 1917 measurement by astronomer Charles 
St. John (1917) failed to detect the effect, sowing considerable doubt about the validity 
of the theory. Thirty years of such measurements revealed mainly that the observed shifts 
in solar spectral lines are dominated by Doppler shifts due to radial mass motions in the 
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solar photosphere, and by line shifts due to the high pressures in the solar atmosphere, 
making detection of the Einstein shift very difficult. It would be 1962 before a reliable solar 
redshift measurement would be made. Similarly inconclusive were attempts to measure 
the gravitational redshift of spectral lines from white dwarfs, primarily from Sirius B 
and 40 Eridani B, both members of binary systems. Because of uncertainties in the 
determination of the masses and radii of these stars, and because of possible complications 
in their spectra due to scattered light from their companions, reliable, precise measurements 
were not possible. Furthermore, by the late 1950s, it was being suggested that the 
gravitational red shift was not a true test of general relativity after all. According to Leonard 
Schiff and Robert Dicke, the gravitational red shift was a consequence purely of the 
principle of equivalence, and did not test the specific field equations of gravitational theory. 

Cosmology was one area where general relativity could conceivably be confronted 
with observation. Initially the theory met with success in its ability to account for the 
observed expansion of the universe, yet by the 1940s there was considerable doubt 
about its applicability. According to pure general relativity, the expansion of the universe 
originated in a dense primordial explosion called the “big bang.” However, at that time, 
the measured value of the expansion rate was so high that working backward in time using 
the cosmological solutions of general relativity led to the conclusion that the age of the 
universe was less than that of the Earth! One result of this doubt was the rise in popularity 
during the 1950s of the steady-state cosmology of Herman Bondi, Thomas Gold, and Fred 
Hoyle. This model avoided the big bang altogether, and allowed for the expansion of the 
universe by the continuous creation of matter. But by the late 1950s, revisions in the cosmic 
distance scale had reduced the expansion rate by a factor of five, and had thereby increased 
the age of the universe in the big bang model to a more acceptable level. Nevertheless, 
cosmology was still in its infancy, hardly suitable as an arena for testing theories of 
gravity. The era of “precision cosmology” would not begin until the launch of the Cosmic 
Background Explorer (COBE) satellite in 1989 followed by its precise measurements of 
the spectrum and fluctuations of the cosmic background radiation. 

Meanwhile, a small “cottage industry” had sprung up, devoted to the construction 
of alternative theories of gravitation. Some of these theories were produced by such 
luminaries as Henri Poincaré, Alfred North Whitehead, Edward Arthur Milne, George 
Birkhoff, Nathan Rosen, and Frederick Belinfante. Many of these authors expressed an 
uneasiness with the notions of general covariance and curved spacetime, which were 
built into general relativity, and responded by producing “special relativistic” theories 
of gravitation. Many of these theories considered spacetime itself to be governed by 
special relativity, and treated gravitation as a field on that background. As of 1960, it was 
possible to enumerate at least twenty-five such alternative theories, as found in the primary 
research literature between 1905 and 1960; for a partial list, see Whitrow and Morduch 
(1965). 

Thus, by 1960, it could be argued that the validity of general relativity rested on 
the following empirical foundation: one test of moderate precision (the perihelion shift, 
approximately 10 percent), one test of low precision (the deflection of light, approximately 
25 percent), one inconclusive test that was not a real test anyway (the gravitational 
redshift), and cosmological observations that could not distinguish between general 
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relativity and the steady-state theory. Furthermore, a variety of alternative theories laid 
claim to viability. 

In addition, the attitude toward the theory seemed to be that, whereas it was undoubtedly 
important as a fundamental theory of nature, its observational contacts were limited. This 
view was present for example in the standard textbooks on general relativity of this period, 
such as those by Moller (1952), Synge (1960), and Landau and Lifshitz (1962). As a 
consequence, general relativity was cut off from the mainstream of physics. It was during 
this period that one newly minted graduate of the California Institute of Technology was 
advised not to pursue this subject for his graduate work, because general relativity “had so 
little connection with the rest of physics and astronomy” (his name: Kip Thorne). 

However, the events of 1959-1960 changed all that. The pace of research in general 
relativity and relativistic astrophysics began to quicken and, associated with this renewed 
effort, the systematic high-precision testing of gravitational theory became an active and 
challenging field, with many new experimental and theoretical possibilities. These included 
new versions of old tests, such as the gravitational red shift and deflection of light, with 
accuracies that were unthinkable before 1960. They also included brand new tests of 
gravitational theory, such as the gyroscope precession, the time delay of light, and the 
“Nordtvedt effect” in lunar motion, all discovered theoretically after 1959. 

Because many of the experiments involved the resources of programs for interplanetary 
space exploration and observational astronomy, their cost in terms of money and manpower 
was high and their dependence upon increasingly constrained government funding agen- 
cies was strong. Thus, it became crucial to have as good a theoretical framework as possible 
for comparing the relative merits of various experiments, and for proposing new ones that 
might have been overlooked. Another reason that such a theoretical framework was neces- 
sary was to make some sense of the large (and still growing) number of alternative theories 
of gravitation. Such a framework could be used to classify theories, elucidate their simi- 
larities and differences, and compare their predictions with the results of experiments in a 
systematic way. It would have to be powerful enough to be used to design and assess exper- 
imental tests in detail, yet general enough not to be biased in favor of general relativity. 

A leading exponent of this viewpoint was Dicke (1964). It led him and others to perform 
several high-precision null experiments that greatly strengthened the empirical support for 
the foundations of gravitation theory. Within this viewpoint one asks general questions 
about the nature of gravity and devises experiments to test them. The most important 
dividend of the Dicke framework is the understanding that gravitational experiments can 
be divided into two classes. The first consists of experiments that test the foundations 
of gravitation theory, one of these foundations being the principle of equivalence. 
These experiments (E6tvés experiment, Hughes-Drever experiment, gravitational redshift 
experiment, and others) accurately verify that gravitation is a phenomenon of curved 
spacetime, that is, it must be described by a “metric theory” of gravity, at least to a 
high level of precision. General relativity and Brans-Dicke theory are examples of metric 
theories of gravity. 

The second class of experiments consists of those that test metric theories of gravity. 
Here another theoretical framework was developed that takes up where the Dicke 
framework leaves off. Known as the “Parametrized post-Newtonian” or PPN formalism, 
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it was pioneered by Kenneth Nordtvedt Jr. (1968b), and later extended and improved 
by Will (1971c), Will and Nordtvedt (1972), and Will (1973). The PpN framework takes 
the slow motion, weak field, or post-Newtonian limit of metric theories of gravity, and 
characterizes that limit by a set of ten real-valued parameters. Each metric theory of gravity 
has particular values for the PPN parameters. The PPpN framework was ideally suited to 
the analysis of solar system gravitational experiments, whose task then became one of 
measuring the values of the PPN parameters and thereby delineating which theory of gravity 
is correct. A second powerful use of the PPN framework was in the discovery and analysis 
of new tests of gravitation theory, examples being the Nordtvedt effect (Nordtvedt, 1968a), 
preferred-frame effects (Will, 1971b), and preferred-location effects (Will, 1971b, 1973). 
The Nordtvedt effect, for instance, is a violation of the equality of acceleration of massive 
bodies, such as the Earth and Moon, in an external field; the effect is absent in general 
relativity but present in many alternative theories, including the Brans-Dicke theory. The 
third use of the PPN formalism was in the analysis and classification of alternative metric 
theories of gravitation. After 1960, the invention of alternative gravitation theories did not 
abate but changed character. The crude attempts to derive Lorentz-invariant field theories 
described previously were mostly abandoned in favor of metric theories of gravity, whose 
development and motivation were often patterned after that of the Brans-Dicke theory. 
A “theory of gravitation theories” was developed around the PPN formalism to aid in 
their systematic study. The pPN formalism thus became the standard theoretical tool for 
analyzing solar system experiments, looking for new tests, and studying alternative metric 
theories of gravity. 

But by the middle 1970s it became apparent that the solar system could no longer be the 
sole testing ground for gravitation theories. The reason was that many alternative theories 
of gravity agreed with general relativity in their weak-field, slow-motion limits closely 
enough to pass all solar system tests. But they did not necessarily agree in other predictions, 
such as neutron stars, black holes, gravitational radiation, or cosmology, phenomena that 
involved strong or dynamical gravity. 

This was confirmed in the summer of 1974 with the discovery by Joseph Taylor and 
Russell Hulse of the binary pulsar (Hulse and Taylor, 1975). Here was a system that 
featured, in addition to significant post-Newtonian gravitational effects, highly relativistic 
gravitational fields associated with the pulsar (and possibly its companion) and the 
possibility of the emission of gravitational radiation by the binary system. The role of 
the binary pulsar as a new arena for testing relativistic gravity was confirmed four years 
later with the announcement (Taylor et al., 1979) that the rate of change of the orbital 
period of the system had been measured. The result agreed with the prediction of general 
relativity for the rate of orbital energy loss due to the emission of gravitational radiation. 
But it disagreed strongly with the predictions of many alternative theories, even some with 
post-Newtonian limits identical to that of general relativity. 

By 1981, when the first edition of this book was published, it was not uncommon to 
describe the period 1960-1980 as a “golden era” for experimental gravity. Many of the 
events of that period were described for a lay audience in my 1986 book Was Einstein 
Right? (Will, 1986). But the phrase “golden era” suggests that it was downhill from that 
time forward. Quite the opposite was true. 
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Solar-system tests of relativistic gravity continued, with highlights including dramati- 
cally improved measurements of light deflection and the Shapiro time delay, measurements 
of “frame-dragging” by the Gravity Probe B and the Laser Geodynamics Satellite 
(LAGEOS) experiments, and steadily improving lunar laser ranging. Binary pulsar tests 
continued, aided by remarkable discoveries, including the famous “double pulsar” and a 
pulsar in a triple system. 

At the same time, the central thrust of testing gravity began to shift away from the weak- 
field limit. Two themes began to emerge as the key themes for the future. 

The first theme is Dynamical Gravity. This involves phenomena in which the variation 
with time of the spacetime geometry plays an important role. In the solar system, velocities 
are small compared to the speed of light and the masses of the planets are small compared 
to the mass of the sun, so the underlying spacetime geometry can be viewed either as being 
stationary or as evolving in a quasistationary manner. But in the binary pulsar, for example, 
the two bodies have almost the same mass and are orbiting each other ten times faster 
than planets in the solar system, and consequently the varying spacetime geometry that 
they both generate devolves into gravitational waves propagating away from the system, 
causing it to lose energy. A more dramatic example is the final inspiral of the two black 
holes whose gravitational signal was detected by LIGO in 2015. The black holes are made 
of pure curved spacetime, and the manner in which that geometry evolved during the 
final fractions of a second of the inspiral and merger left its imprint on the gravitational 
waves that were detected. The final black hole that was left over even oscillated a few 
times, emitting a specific kind of gravitational radiation called ringdown waves. This is the 
regime of dynamical gravity. Dynamical gravity often goes hand in hand with gravitational 
radiation. 

The second theme is Strong Gravity. Much like modern art, the term “strong” means 
different things to different people. To someone steeped in general relativity, the principal 
figure of merit that distinguishes strong from weak gravity is the quantity « ~ Gm/c’r, 
where m is the characteristic mass scale of the phenomenon, r is the characteristic distance 
scale, and G and c are the Newtonian gravitational constant and the speed of light, 
respectively. Near the event horizon of a nonrotating black hole, or for the expanding 
observable universe, « ~ 1; for neutron stars, € ~ 0.2. These are the regimes of strong 
gravity. For the solar system, ¢ < 10~>; this is the regime of weak gravity. 

An alternative view of “strong” gravity comes from the world of particle physics. Here 
the figure of merit is Gm/c’r? ~ ¢~*, where the curvature of spacetime associated 
with the phenomenon, represented by the left-hand side, is comparable to the inverse 
square of a favorite length scale @. If ¢ is the Planck length (hG/c?)!/? ~ 10~*5 m, this 
would correspond to the regime where one expects conventional quantum gravity effects 
to come into play. Another possible scale for @ is the TeV scale associated with many 
models for unification of the forces, or models with extra spacetime dimensions. From this 
viewpoint, strong gravity is where the radius of curvature of spacetime is comparable to 
the fundamental length. Weak gravity is where the radius of curvature is much larger than 
this. The universe at the Planck time is strong gravity. Just outside the event horizon of an 
astrophysical black hole is weak gravity. 

We will adopt the relativist’s view of strong gravity. 
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The boundary between dynamical gravity and strong gravity is somewhat fuzzy. One can 
explore strong gravity alone by studying the motion of a star around a static supermassive 
black hole or of gas around a neutron star. Gravitational waves can be emitted by a binary 
system of white dwarfs, well characterized by weak gravity. However, the strongest waves 
tend to come from systems with compact, strongly gravitating bodies, because only such 
bodies can get close enough together to reach the relativistic speeds required to generate 
strong gravitational waves. And the universe as a whole can be thought of as both “strong 
gravity” and dynamical, yet because of the high degree of symmetry, gravitational waves 
do not play a major role in its evolution. By contrast, primordial gravitational waves could 
be detectable, in fluctuations of the cosmic background radiation, for example. Regardless 
of the specific context, testing general relativity in the strong-field and dynamical regimes 
will dominate this field for some time to come. 

Asa young student of seventeen at the Polytechnical Institute of Ziirich, Einstein studied 
the work of Helmholtz, Maxwell, and Hertz, and ultimately used his deep understanding 
of electromagnetic theory as a foundation for special and general relativity. He appears 
to have been especially impressed by Hertz’s confirmation that light and electromagnetic 
waves are one and the same (Schilpp, 1949). The electromagnetic waves that Hertz studied 
were in the radio part of the spectrum, at 30 MHz. It is amusing to note that, sixty years 
later, the “golden age” for testing relativistic gravity began with radio waves, the 440 MHz 
waves reflected from Venus, and ended with radio waves, the pulsed signals from the binary 
pulsar, observed at 430 MHz. We are now in a new era for testing general relativity, an era 
in which we can exploit and study an entirely new kind of wave, a wave in the fabric of 
spacetime itself. 

During the half-century that closed on the centenary of Einstein’s formulation of general 
relativity, the empirical foundations of his great theory were strengthened as never before. 
The question then arises, why bother to continue to test it? One reason is that gravity is a 
fundamental interaction of nature, and as such requires the most solid empirical underpin- 
ning we can provide. Another is that all attempts to quantize gravity and to unify it with 
the other forces suggest that the standard general relativity of Einstein may not be the last 
word. Furthermore, the predictions of general relativity are fixed; the pure theory contains 
no adjustable constants, so nothing can be changed. Thus every test of the theory is either 
a potentially deadly test or a possible probe for new physics. Although it is remarkable 
that this theory, born 100 years ago out of almost pure thought, has managed to survive 
every test, the possibility of finding a discrepancy will continue to drive experiments for 
years to come. These experiments will search for new physics beyond Einstein in many 
different directions: the large distance scales of the cosmological realm; scales of very 
short distances or high energy; and the realms of strong and dynamical gravity. 


é Units and conventions 


Throughout this book, we will adopt the units and conventions of standard textbooks such as Misner, Thorne, 
and Wheeler (1973) (hereafter referred to as MTW) or Schutz (2009). For a pedagogical development of many 
of the topics presented here, such as Newtonian gravity, post-Newtonian theory, and gravitational radiation, 
we will refer readers to Poisson and Will (2014) (hereafter referred to as PW). Although we have attempted to 
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produce a reasonably self-contained account of gravitation theory and gravitational experiments, the reader's 
path will be greatly smoothed by a familiarity with general relativity at the level of one of these texts. 

We will use “geometrized units,’in which G = c = 1 (exceptin Chapter 2) andin which mass and time have 
the same units as distance. Greek indices on vectors and tensors will run over the four spacetime dimensions, 
while Latin indices will run only over spatial dimensions. We will use the Einstein summation convention, 
in which one sums repeated indices over their range. Multi-index objects, such as products x/x*x! . . . will be 
denoted using capital superscripts, e.g.,.x", where Nis the number of indices. Partial derivatives and covariant 
derivatives will be denoted by commas and semicolons preceding indices, respectively. Parentheses enclosing 
indices will denote symmetrization, while square brackets will denote antisymmetrization. 


The Einstein Equivalence Principle 


The Principle of Equivalence has played a central role in the development of gravitation 
theory. Newton regarded this principle as such a cornerstone of mechanics that he devoted 
the opening paragraphs of his masterwork Philosophiae Naturalis Principia Mathematica 
(often called the “Principia’”’) to a detailed discussion of it (Newton, 1686). On page 1, 
Definition 1, he defined the “quantity” of matter to be its mass, and also defined the 
“weight” of a body. He asserted that the mass “is proportional the weight, as I have found 
by experiments on pendulums, very accurately made, which shall be shown hereafter.” To 
Newton, the Principle of Equivalence demanded that the “mass” of any body, namely that 
property of a body (inertia) that regulates its response to an applied force, be equal to its 
“weight,” that property that regulates its response to gravitation. Bondi (1957) coined the 
terms “inertial mass” m , and “passive gravitational mass” mp, to refer to these quantities, 
so that Newton’s second law and the law of gravitation take the forms 


F=ma, F=mpg, (2.1) 


where g is the gravitational field. The Principle of Equivalence can then be stated 
succinctly: for any body,! 


Mp =m. (2.2) 


An alternative statement of this principle is that all bodies fall in a gravitational field with 
the same acceleration regardless of their mass or internal structure. Newton’s equivalence 
principle is now generally referred to as the “Weak Equivalence Principle” (WEP). 

It was Einstein who added the key element to WEP that revealed the path to general 
relativity. If all bodies fall with the same acceleration in an external gravitational field, 
then to an observer in a freely falling elevator in the same gravitational field, the bodies 
should be unaccelerated, except for possible tidal effects due to inhomogeneities in 
the gravitational field. Tidal effects can be made as small as one pleases by confining 
everything a sufficiently small elevator. Thus, insofar as their mechanical motions are 
concerned, the bodies will behave as if gravity were absent. Einstein went one step further. 
He proposed that not only should mechanical laws behave in such an elevator as if gravity 
were absent but also so should all the laws of physics, including, for example, the laws of 
electrodynamics. This new principle led Einstein to general relativity. It is now called the 
“Einstein Equivalence Principle” (EEP). 

Yet, it was only in the 1960s that we gained a deeper understanding of the significance of 
these principles of equivalence for gravitation and experiment. Largely through the work 


' Although Newton asserted only that mp and my be proportional to each other, they can be made equal by 
suitable choice of units for a and g 
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of Robert Dicke, we have come to view principles of equivalence, along with experiments 
such as the EétvGs experiment and the gravitational redshift experiment, as probes more of 
the foundations of gravitation theory than of general relativity itself. This viewpoint is part 
of what has come to be known as the Dicke Framework, to be described in Section 2.1, 
allowing one to discuss at a very fundamental level the nature of spacetime and gravity. 
Within this framework one asks questions such as: Do all bodies respond to gravity with the 
same acceleration? Does energy conservation imply anything about gravitational effects? 
What types of fields, if any, are associated with gravitation — scalar fields, vector fields, 
tensor fields ...? In Section 2.2, we argue that the Einstein Equivalence Principle is the 
foundation for all gravitation theories that describe gravity as a manifestation of curved 
spacetime, the so-called metric theories of gravity. In Section 2.3 we describe the empirical 
support for EEP from a variety of experiments. 

Einstein’s generalization of the Weak Equivalence Principle may not have been a 
generalization at all, according to a conjecture based on the work of Leonard Schiff. 
In Section 2.4 we discuss Schiff’s conjecture, which states that any complete and self- 
consistent theory of gravity that satisfies WEP necessarily satisfies EEP. Schiff’s conjecture 
and the Dicke Framework have spawned a number of concrete theoretical formalisms for 
comparing and contrasting metric theories of gravity with nonmetric theories, for analyzing 
experiments that test EEP, and for proving Schiff’s conjecture. These include the THey 
and c* formalisms, presented in Section 2.5, and the Standard Model Extension (SME), 
discussed in Section 2.6. 

What would happen if a violation of one of these principles were observed? One 
possibility is that the entire edifice of metric theories of gravity, including general relativity, 
would come tumbling down. Another possibility is that the apparent violation would 
actually signal the presence of some field or interaction that lies outside the standard 
model of strong, electromagnetic and weak interactions, plus gravity. This latter viewpoint 
has proven to be fruitful, exploiting the ultrahigh precision apparatus developed to test 
equivalence principles in order to search for and ultimately place limits on new physics. 
We will describe a few examples of this approach in Section 2.7. 


2.1 The Dicke Framework 


The Dicke Framework for analyzing experimental tests of gravitation was spelled out in 
appendix 4 of Dicke’s Les Houches lectures (Dicke, 1964). It makes two main assumptions 
about the type of mathematical formalism to be used in discussing gravity: 


1. Spacetime is a four-dimensional differentiable manifold, with each point in the manifold 
corresponding to a physical event. The manifold need not a priori have either a metric 
or an affine connection. The hope is that experiment will force us to conclude that it 
has both. 

2. The equations of gravity and the mathematical entities in them are to be expressed in a 
form that is independent of the particular coordinates used, i.e., in covariant form. 


B 


2.1 The Dicke Framework 


Notice that even if there is some physically preferred coordinate system or reference 
frame in spacetime, the theory can still be put into covariant form. For example, if a 
theory has a preferred cosmic time coordinate, one can introduce a scalar field 7(P), whose 
numerical values are equal to the values of the preferred time ¢ according to T(P) = ¢(P), 
where P is a point in spacetime. If spacetime is endowed with a metric, one might also 
demand that VT be a timelike vector field and be consistently oriented toward the future 
(or the past) throughout spacetime by imposing the covariant constraints VT- VT < 0 
and V ® VT = 0. where V is a covariant derivative with respect to the metric. Other 
types of theories have “flat background metrics” 7; these can also be written covariantly 
by defining 77 to be a second-rank tensor field whose Riemann tensor vanishes everywhere, 
that is, Riem (7) = 0 and by defining covariant derivatives and contractions with respect 
to 7. In most cases, this covariance is achieved at the price of the introduction into the 
theory of “absolute” or “prior geometric” elements (7,7), that are not determined by 
the dynamical equations of the theory. Some authors regard the introduction of absolute 
elements as a failure of general covariance (Einstein would be one example), however we 
will adopt the weaker assumption of coordinate invariance alone. (For further discussion 
of prior geometry, see Section 3.3.) 

Having laid down this mathematical viewpoint, Dicke then imposes two constraints on 
all acceptable theories of gravity. They are: 


1. Gravity must be associated with one or more fields of tensorial character (scalars, 
vectors, and tensors of various ranks). 

2. The dynamical equations that govern gravity must be derivable from an invariant action 
principle. 


These constraints strongly confine acceptable theories. For this reason we should accept 
them only if they are fundamental to our subsequent arguments. For most applications of 
the Dicke Framework only the first constraint is needed. It is a fact, however, that the most 
successful gravitation theories, and a// theories of current interest, are those that satisfy 
both constraints. 

The Dicke Framework is particularly useful for asking questions such as what types of 
fields are associated with gravity, and how do they interact with the fundamental fields of 
the standard model of electromagnetic, weak and strong interactions. For example, there is 
strong evidence from elementary particle physics for at least one symmetric second-rank 
tensor field that is approximated by the Minkowski metric 7 when gravitational effects can 
be ignored. The Hughes-Drever experiment rules out or strongly constrains the existence 
of more than one second-rank tensor field, each coupling directly to matter, and various 
laboratory tests of Lorentz invariance rule out a long-range vector field coupling directly 
to matter. However, this is not the only powerful use of the Dicke Framework. 

The general unbiased viewpoint embodied in the Dicke Framework has allowed theorists 
to formulate a set of fundamental criteria that any gravitation theory should satisfy if it is 
to be viable [here we do not impose constraints (1) and (2) above]. Two of these criteria 
are purely theoretical, whereas two are based on experimental evidence. 

(i) It must be complete, that is, it must be capable of analyzing from “first principles” 
the outcome of any experiment of interest. It is not enough for the theory to postulate 
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that bodies made of different material fall with the same acceleration. The theory must 
incorporate a complete set of electrodynamic and quantum mechanical laws, which can 
be used to calculate the detailed behavior of real bodies composed of nucleons and 
electrons in gravitational fields. This demand should not be extended too far, however. 
In areas such as quantum gravity, unification with the standard model of particle physics, 
spacetime singularities, and cosmic initial conditions, even special and general relativity 
are not regarded as being complete or fully developed. We also do not regard the presence 
of “absolute elements” and arbitrary parameters in gravitational theories as a sign of 
incompleteness, even though they are generally not derivable from “first principles,” rather 
we view them as part of the class of cosmic boundary conditions. The most common 
and successful way of formulating a “complete” theory is to use an action principle that 
combines the standard model action (as it is currently known) for the nongravitational 
sector with an action for the gravitational “fields” (including the spacetime metric), 
together with some coupling between them. 

(i1) It must be self-consistent, that is, its prediction for the outcome of every experiment 
must be unique. When one calculates predictions by two different, though equivalent 
methods, one must get the same results. An example is the bending of light computed 
either in the geometrical optics limit of Maxwell’s equations or in the zero-rest-mass limit 
of the motion of test particles. 

(iii) It must be relativistic, that is, in the limit as gravity is “turned off’ compared to 
other physical interactions, the nongravitational laws of physics must reduce to the laws 
of special relativity, either perfectly or to a high degree of precision. The evidence for this 
comes from more than a century of successes of special relativity in areas ranging from 
high-energy physics to atomic physics (see Box 2.1). This does not necessarily imply a 
blind or perfect acceptance of Lorentz invariance and special relativity, and in fact vigorous 
experimental searches for potential violations of Lorentz invariance are continuing, in part 
to search for relic signatures of quantum gravity or of weak cosmic fields that couple to 
matter (see Section 2.6). 

The fundamental theoretical object that enters these laws is the Minkowski metric 7, 
which has orthonormal tetrads related by Lorentz transformations, and which determines 
the ticking rates of atomic clocks and the lengths of laboratory rods. If we view 77 as a 
field, then we conclude that there must exist at least one second-rank tensor field in the 
Universe, a symmetric tensor ~, that is well approximated by 7 when gravitational effects 
can be ignored. 

Let us examine what the evidence for special relativity does and does not tell us about the 
tensor field a. First, it does not guarantee the existence of global Lorentz frames, that is, 
coordinate systems extending throughout spacetime in which w = 7 = diag(—1, 1, 1, 1). 
Nor does it demand that at each event P, there exist local frames related by Lorentz 
transformations, in which the laws of nongravitational physics take on their special 
relativistic forms. Special relativity only demands that, in the limit as gravity is “turned 
off” the nongravitational laws of physics reduce to the laws of special relativity. 

We will henceforth assume the existence of the tensor field a. 

(iv) It must have the correct Newtonian limit, that is, in the limit of weak gravitational 
fields and slow motions, it must reproduce Newton’s laws. The overwhelming majority of 
phenomena in the universe can be very adequately described by the laws of Newtonian 
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Tests of special relativity 


Special relativity has been so thoroughly integrated into the fabric of modern physics that its validity is rarely 
challenged, except by cranks and crackpots. But we should remember that it does rest on a strong empirical 
foundation, including a number of classic tests. 

The Michelson-Morley (1887) experiment and its many descendents (Shankland et al., 1955; Champeney 
etal., 1963; Jaseja et al., 1964; Brillet and Hall, 1979; Riis et al., 1988; Krisher et al., 1990b) failed to find evidence 
of a variation of the speed of light with the Earth’s velocity through a hypothetical “aether.” 

Several classic experiments were performed to verify that the speed of light is independent of the speed 
of the emitter. If the speed of light were given by c + kv, where v is the velocity of the emitter, and k is a 
parameter to be measured, then orbits of binary-star systems would appear to have an anomalous eccentricity 
unexplainable by normal Newtonian gravity. This test is not unambiguous at optical wavelengths, however, 
because light is absorbed and reemitted by the intervening interstellar medium, thereby losing the memory 
of the speed of the source, a phenomenon known to astronomers as extinction. But at X-ray wavelengths, the 
path length of extinction is tens of kiloparsecs, so Brecher (1977) used three nearby X-ray binary systems in 
our galaxy to obtain a bound |k| < 2 x 10°, for typical orbital velocities v/c ~ 1073. 

At the other extreme, a 1964 experiment at CERN used neutral pions moving at v/c > 0.99975 as the 
source of light. Photons produced by the decay +° —> -y + ~y were collimated and timed over a flight path 
of 30 meters. The agreement of the photons’ speed with the laboratory value set a bound |k| < 10~‘ for 
v = ¢(Alvager et al., 1964). 

The observational evidence for time dilation is overwhelming. Ives and Stilwell (1938) measured the 
frequency shifts of radiation emitted in the forward and backward direction by moving ions of H) and H3 
molecules. The first-order Doppler shift cancels out from the sum of the forward and backward shifts, revealing 
the second-order time-dilation effect, which was found to agree with theory. (Ironically, Ives was a die-hard 
opponent of special relativity.) 

The classic Rossi-Hall (1941) experiment showed that the lifetime of j1-mesons was prolonged by the 
Lorentz factor -y = (1 — v?/c’)~'/2, Muons are created in the upper atmosphere when cosmic-ray 
protons collide with nuclei of air, producing pions, which decay quickly to muons. With a rest half-life of 
2.2 x 10~°s, and with no time dilation, a muon travelling near the speed of light should travel only 2/3 
of a kilometer on average before decaying to a harmless electron or positron and two neutrinos. Yet muons 
are the primary component of cosmic rays detected at sea level. Rossi and Hall measured the distribution of 
muons as a function of altitude and also measured their energies, and confirmed the time-dilation formula. 

In an experiment performed in 1966 at CERN, muons in a storage ring moving at v/c = 0.997 were 
observed to have lifetimes 12 times larger than muons at rest, in agreement with the prediction to 2 percent 
(Farley et al., 1966). Also, since the storage ring was 5 meters in diameter, the muons’ accelerations were 
greater than the gravitational acceleration on the Earth’s surface by a factor of 10'°; these accelerations had 
no apparent effect on their decay rates. 

The incorporation of Lorentz invariance into quantum mechanics provided further support for special 
relativity. The achievements include the prediction of anti-particles and elementary particle spin, and the 
many successes of relativistic quantum field theory. 

Fora pedagogical review written on the occasion of the 2005 centenary of special relativity, see Will (2006). 
We will describe contemporary tests of what is today called “Local Lorentz Invariance” in Section 2.3.2. 
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gravity. To a high degree of accuracy, Newton rules the Sun, the Earth, the solar system, 
all normal stars, galaxies and clusters of galaxies. On laboratory scales, the inverse-square 
law of Newton has been validated down to distances as small as 100 microns. 

Thus, to have a shot at being viable, a gravitation theory must be complete, self- 
consistent, relativistic, and compatible with Newtonian gravity. 


2.2 The Einstein Equivalence Principle 
——————————————————————————————————————————————————— 


The Einstein Equivalence Principle is the foundation of all curved spacetime or “metric” 
theories of gravity, including general relativity. It is a powerful tool for dividing grav- 
itational theories into two distinct classes: metric theories, those that embody EEP, and 
nonmetric theories, those that do not embody EEP. For this reason, we will discuss it in 
some detail and devote the next section to the experimental evidence that supports it. 

We begin by stating the Weak Equivalence Principle in more precise terms than those 
used before. WEP states that ifan uncharged test body is placed at an initial event in space- 
time and given an initial velocity there, then its subsequent trajectory will be independent of 
its internal structure and composition. By “uncharged test body” we mean an electrically 
neutral body that has negligible self-gravitational energy (as estimated using Newtonian 
theory) and that is small enough in size so that its coupling to inhomogeneities in external 
fields can be ignored. In the same spirit, it is also useful to define a “local nongravitational 
test experiment” to be any experiment: (i) performed in a freely falling laboratory that 
is shielded and is sufficiently small that inhomogeneities in the external fields can be 
ignored throughout its volume and (ii) in which self-gravitational effects are negligible. 
For example, a measurement of the fine structure constant is a local nongravitational test 
experiment; a Cavendish experiment to measure Newton’s constant G is not. 

The Einstein Equivalence Principle then states: 


1. WEP is valid. 

2. The outcome of any local nongravitational test experiment is independent of the velocity 
of the (freely falling) apparatus. 

3. The outcome of any local nongravitational test experiment is independent of where and 
when in the universe it is performed. 


This principle is at the heart of gravitation theory, for it is possible to argue convincingly 
that if EEP is valid, then gravitation must be a curved-spacetime phenomenon, that is, must 
satisfy the postulates of Metric Theories of Gravity. These postulates state: 


1. Spacetime is endowed with a metric. 

2. The world lines of test bodies are geodesics of that metric. 

3. In local freely falling frames, the nongravitational laws of physics are those of special 
relativity. 


The argument proceeds as follows. The validity of WEP endows spacetime with a family 
of preferred trajectories, the world lines of freely falling test bodies. In a local frame that 
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follows one of these trajectories, test bodies have unaccelerated motions. Furthermore, the 
results of local nongravitational test experiments are independent of the velocity of the 
frame. In two such frames located at the same event P in spacetime, but moving relative 
to each other, all the nongravitational laws of physics must make the same predictions 
for identical experiments, that is, they must be Lorentz invariant. We call this aspect of 
EEP Local Lorentz Invariance (LLI). Therefore, there must exist in the universe one or 
more second-rank tensor fields op), ap), ... that reduce in a local freely falling frame 
to fields that are proportional to the Minkowski metric, 6) (P)n, ¢@ (P)n, ..., where 
@(P) are scalar fields that can vary from event to event. Different members of this 
set of fields may couple to different nongravitational fields, such as boson fields, fermion 
fields, electromagnetic fields, and so on. However, the results of local nongravitational 
test experiments must also be independent of the spacetime location of the frame. We 
call this Local Position Invariance (LPI). There are then two possibilities. (i) The local 
versions of yp) have constant coefficients, that is, the scalar fields 6“) (P) are actually 
constants, independent of P. It is therefore possible by a simple universal rescaling of 
coordinates and coupling constants (such as the unit of electric charge) to set each scalar 
field equal to unity in every local frame. (ii) The scalar fields 64) (P) must be constant 
multiples of a single scalar field ¢(P), that is, 6“) (P) = c4¢(P). If this is true, then 
physically measurable quantities, being dimensionless ratios, will be location independent 
(essentially, the scalar field will cancel out). One example is a measurement of the fine 
structure constant; another is a measurement of the length of a rigid rod in centimeters, 
since such a measurement is a ratio between the length of the rod and that of a standard rod 
whose length is defined to be one centimeter. Thus, a combination of a rescaling of coupling 
constants to set the c4’s equal to unity (redefinition of units), together with a “conformal” 
transformation to a new field y = @ 'ab guarantees that the local version of 
will be 7. 

In either case, we conclude that there exist fields that reduce to 7) in every local freely 
falling frame. Elementary differential geometry then shows that these fields are one and the 
same: a unique, symmetric second-rank tensor field that we now denote by “the metric” g. 
This g has the property that it possesses a family of preferred worldlines called geodesics, 
and that at each event P there exist local frames, called local Lorentz frames, that follow 
these geodesics, in which 


Suv(P) = Nv s "Ss (P) =0. (2.3) 
x 

However, geodesics are straight lines in local Lorentz frames, as are the trajectories of test 
bodies in local freely falling frames, hence the test bodies move on geodesics of g and 
the Local Lorentz frames coincide with the freely falling frames. The idea that a single 
field g interacts with all the nongravitational fields in a unique manner is often called 
“universal coupling.” We will discuss the implications of the postulates of metric theories 
of gravity and of universal coupling in more detail in Chapter 3. Because EEP is so crucial 
to this conclusion about the nature of gravity, we turn now to the supporting experimental 
evidence. 
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Choice of units 


Throughout this chapter we will avoid units in which c = 1. The reason for this is that if EEP is not valid then 
the speed of light may depend on the nature of the devices used to measure it. Thus, to be precise we should 
denote c as the speed of light as measured by some standard experiment. However, once we accept the validity 
of EEP in Chapter 3 and beyond, then c has the same value in every local Lorentz frame, independently of the 
method used to measure it, and thus c can be set equal to unity by an appropriate choice of units. 


2.3 Experimental Tests of the Einstein Equivalence Principle 
A aS 


2.3.1 Tests of the Weak Equivalence Principle 


A direct test of WEP is the comparison of the acceleration of two laboratory-sized bodies 
of different composition in an external gravitational field, today often called the Edtvés 
experiment. If WEP were invalid, then the accelerations of different bodies would differ. 
The simplest way to quantify such possible violations of WEP in a phenomenological form 
suitable for comparison with experiment is to suppose that for a body of inertial mass my, 
the passive mass mp is no longer equal to m;. Now the inertial mass of a typical laboratory 
body is made up of several types of mass energy: rest energy of the constituent particles, 
electromagnetic energy, weak-interaction energy, strong-interaction energy, and so on. If 
one of these forms of energy contributes to mp differently than it does to my , a violation of 
WEP would result. One could then write 


a3 se? 
mp =m + >> n a (2.4) 
A 


where E* is the internal energy of the body generated by interaction A, 7“ is a 
dimensionless parameter that measures the strength of the violation of WEP induced by 
that interaction, and c is the speed of light. For two bodies, the acceleration is then given by 


, EA , EA 
=(1 ={1 ae. 2.5 
a oe, mays  ® rea male (2.5) 


where we have dropped the subscript “I” on m, and mp. 
A measurement of the relative difference in acceleration then yields a quantity called the 
“E6tvés ratio” given by 


2\ay — ap| A ea E> 
= _ : 2.6 
a la, +. ap| x. mc? myc? -) 


Thus, experimental limits on 77 place limits on the WEP-violation parameters 77°. 

One way to place a bound on 77 is to drop two different objects in the Earth’s gravitational 
field and compare how long they take to fall. Legend has it that Galileo Galilei originated 
and verified WEP by dropping objects off the Leaning Tower of Pisa around 1590. In fact, 
the principle of equal acceleration had already been asserted as early as 400 CE by Ioannes 
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Philiponus, and more recently by Benedetti in 1553, and experimental tests were performed 
by Stevin in 1586. If Galileo did indeed drop things off the Tower (and the only information 
about this comes from his last student and biographer Viviani, who wasn’t even born until 
1622), he may simply have been performing a kind of classroom demonstration of an 
established fact for his students. Unfortunately, this approach is plagued by experimental 
errors, such as the difficulty of releasing the objects at exactly the same time, by the effects 
of air drag, and by the short time available for timing the drop. 

A better approach is to balance the gravitational force (which depends on mp) by a 
support force (which depends on mj); the classic model is the pendulum experiments 
performed by Newton and reported in his Principia; improved versions were carried out 
by Bessel (1832) and Potter (1923). The period of the pendulum depends on mp /my, g, and 
the length of the pendulum. These experiments are also troubled by air drag, by errors in 
measuring or controlling the length of the pendulum, and by errors in timing the swing. 

The best approach for laboratory tests was pioneered by Baron Roland von Eotvés, a 
Hungarian geophysicist working around the turn of the 20th century. He developed the 
torsion balance, schematically consisting of a rod suspended by a wire near its mid-point, 
with objects comprised of different materials attached at each end. The point where the 
wire is attached to achieve a horizontal balance depends only on the gravitational masses 
of the two objects, so this configuration does not tell us anything. But if an additional 
gravitational force can be applied in a direction not parallel to the supporting wire, then 
there will be a torque induced on the wire, given by 


N x r(g X @w)-e, (2.7) 


where r is the length of the rod, g is the gravitational acceleration, and e,, and e, are unit 
vectors along the wire and the rod, respectively (see Figure 2.1). If the entire apparatus is 
rotated about some direction with angular velocity w, the torque will be modulated with 
period 27/w. 

In the experiments of Eétvés and his collaborators (E6tvés et al., 1922), the wire and 
g were not quite parallel by about 0.1° because of the centripetal acceleration on the 
apparatus due to the Earth’s rotation; the apparatus was rotated about the direction of the 
wire. In the classic experiments by the groups headed by Dicke and Braginsky (Roll et al., 
1964; Braginsky and Panov, 1972), g was that of the Sun, and the rotation of the Earth 
provided the modulation of the torque at a period of 24 hr. During the late 1980s, the 
search for an intermediate-range “fifth force” (Section 2.7) exploited nearby masses such 
as cliffs, mountains, and water reservoirs (for a bibliography of experiments up to 1991, see 
Fischbach et al. (1992)). In the quest for higher precision, the standard torsion balance rod 
was replaced by a horizontal tray on which masses of different composition were carefully 
fabricated and arranged so as to suppress torques due to normal quadrupole and higher 
moment coupling between the masses and nearby laboratory apparatus (including the 
experimenters). The “Eot-Wash” experiments carried out at the University of Washington 
used such sophisticated arrangements to compare the accelerations of various materials 
toward local topography on Earth, movable laboratory masses, the Sun and the galaxy, and 
established a bound on the Eétvés parameter at the level of 2 x 107! (Su et al., 1994; 
Baefiler et al., 1999; Adelberger, 2001; Schlamminger et al., 2008; Wagner et al., 2012). 
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Schematic arrangement of a torsion-balance E6tvds experiment. 


Galileo has not been completely forgotten however: in a “free-fall” experiment per- 
formed at the University of Colorado, the relative acceleration in a drop tower of two 
bodies made of uranium and copper was measured using a laser interferometric technique 
(Niebauer et al., 1987). 

Going into space makes possible a “perpetual” Galileo experiment in which the two 
bodies never hit the ground, permitting long integration times as a way to reduce errors. 
An example is MICROSCOPE, designed to test WEP to parts in 10!5. Developed by the 
French space agency CNES, it was launched on April 25, 2016, for a two-year mission. 
The two masses are nested coaxial cylinders made of different materials, one a Platinum- 
Rhodium alloy and the other a Titantum-Aluminum-Vanadium alloy. Another pair of 
nested cylinders are made of the same Platinum-Rhodium alloy, in order to account for 
differential accelerations induced by other phenomena such as gravity gradients. The drag- 
free satellite is in a circular, Sun-synchronous orbit at 720 km altitude. In late 2017 the 
MICROSCOPE team reported the result of an analysis of the first year of data, with a 
bound on the order of a part in 10'4 (Touboul et al., 2017). 

Other concepts include tests on orbiting satellites, such as Galileo-Galilei (Nobili et al., 
2012) and STEP (Overduin et al., 2012), and on sounding rockets, such as SR-POEM 
(Reasenberg et al., 2012). 

The recent development of atom interferometry has yielded tests of WEP, albeit 
to modest accuracy, comparable to that of the original E6tv6s experiment. In these 
experiments, one measures the local acceleration of the two separated wavefunctions of 


2] 


2.3 Experimental Tests of the Einstein Equivalence Principle 


—8 ae 
10 Edtvs Atom Interferometry 
Renner 
iv? Free-fall 
| | Fifth-force 
~10 searches 
” Boulder x 
1 Princeton | E6t-Wash 
10— | 
Eot-Wash 
19712 | Lip 
es ra 
40714 
tes MICROSCOPE | 
ais (a,+a,)/2 


79 70. 7% Jo 7o. % % 2 2 2 
GY “9572, “79, %9, 9, 3, 3. 2. 
0% DO “% % % % % 


% 2) 


Year of experiment 


Selected tests of the Weak Equivalence Principle, showing bounds on the Edtvds ratio 77. The light grey region 
represents many experiments originally performed to search for a fifth force, including the “free-fall” and the 
initial Eét-Wash experiments. The dark grey band shows evolving bounds on 77 for gravitating bodies from lunar 
laser ranging (LLR) (see Section 8.1 for discussion). 


an atom such as Cesium by studying the interference pattern when the wavefunctions are 
recombined, and compares that with the acceleration of a nearby macroscopic object of 
different composition (Merlet et al., 2010; Miiller et al., 2010).” Other ongoing experiments 
include dropping cold atom interferometers at the 141 meter drop tower in Bremen, 
Germany, with the ultimate idea of putting such devices in space (Herrmann et al., 2012), 
and testing the free-fall acceleration of anti-Hydrogen (Doser et al., 2012; Perez and 
Sacquin, 2012). The recent discovery of a pulsar in orbit with two white-dwarf companions 
(Ransom et al., 2014) may provide interesting new tests of WEP, because of the strong 
difference in composition between the neutron star and the white dwarfs, as well as precise 
tests of the Nordtvedt effect (see Section 8.1). 

The current upper limits on 7) are summarized in Figure 2.2. 

To determine the limits placed on the individual parameters 7)“ by the best of the current 
experiments, we must estimate the internal energies E“/mc? for the different interactions 
and for different materials. For laboratory-sized bodies, the dominant contribution to E“ 
comes from the atomic nucleus. 


? The claim by Miiller et al. (2010) that these experiments test the gravitational redshift was subsequently shown 
to be incorrect (Wolf et al., 2011). 
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We begin with the strong interactions. The semiempirical mass formula (Rohlf, 1994) 
gives, in MeV, 
(A — 2Z)? }6 
gat RLS aa (2.8) 
where Z and A are the atomic number and mass number, respectively, of the nucleus, and 
where 6 = 1 if (Z, A) = (odd, even), 6 = —1 if (Z, A) = (even, even), and 6 = 0 if A is 
odd. Then 


s ae 6 
Bove 10-2] 13a —150(1 2) 0.71 fe (2.9) 


FS = -15.75A 4+ 17.842 + 23.7 


mc? “43/2 


In the case of electromagnetic interactions, we can distinguish among a number of 
different internal energy contributions, each potentially having its own 7“ parameter. 
For the electrostatic nuclear energy, the semiempirical mass formula yields the estimate 
E®S — 0,711Z(Z— 1)/A'/3 MeV, and thus 


Bs Z(Z-1 
Sa = 75% 10 ee 
Another form of electromagnetic energy is magnetostatic, resulting from the nuclear 
magnetic fields generated by the proton currents. To estimate the nuclear magnetostatic 
energy requires a detailed nuclear shell-model computation. For example, the net proton 
current in any closed angular momentum shell vanishes, hence there is no energy associated 
with the magnetostatic interaction between such a closed shell and any particle outside 
the shell. For aluminum and platinum, the materials used in the Dicke and Braginsky 
experiments, Haugan and Will (1977) showed that (EMS/mc”),,; = 4.1 x 1077 and 

(EMS /mc*)p, = 2.4 x 1077; however no general formula in terms of Z and A is known. 
There is also the hyperfine energy of interaction between the spins of the nucleons and 
the magnetic fields generated by the proton and neutron magnetic moments. Computations 
by Haugan (1978) yielded the estimate E™* /mc* = (27 /V) ux|g7Z° + g; (A — Z)”], where 


(2.10) 


V is the nuclear volume, juy is the nuclear magneton, and g, = 2.79 and g, = —1.91 are 
the gyromagnetic ratios of the proton and neutron, respectively. Then, 
ENF 7? + 9(A —Z)* 
— = 21x 107 |= Sil ) (2.11) 
mc? A2 


For weak interactions, while the parity nonconserving part makes no contribution to the 
nuclear ground-state energy to first order in the weak coupling constant Gy, the parity- 
conserving part does (Haugan and Will, 1976). In the Weinberg-Salam model for weak 
and electromagnetic interactions, the result is 


a _gNZ 
ma = 22% 10 iors [1 + 2(N,Z)] , 
As Z 
g(N,Z) = 0.295 ee + 4sin?6y + a sin? 6 y(2 sin’ Ow — | : (2.12) 


where NV = A — Zis the neutron number, and Oy is the Weinberg angle, with sin’?Oy = 
0.222. 


2.3 Experimental Tests of the Einstein Equivalence Principle 


Table 2.1 Bounds on 77“ parameters from the Edt-Wash experiments. 


Energy type Be-Ti Be-Al 

Strong 4.9 x 107"! 6.5 x 107"! 
Electrostatic 1S x 1O-" DI 3 10 
Hyperfine 2.3 x 1077 0.8 x 1077 
Weak 2.0 x 107? 1.5 x 107 


Gravitational interactions are specifically excluded from WEP and EEP. In Chapter 3, we 
will extend these two principles to incorporate local gravitational effects, thereby defining 
the Gravitational Weak Equivalence Principle (GWEP) and the Strong Equivalence Prin- 
ciple (SEP). These two principles will be useful in classifying alternative metric theories 
of gravity. In any case, for laboratory E6tvés experiments, gravitational interactions are 
totally irrelevant, since for an atomic nucleus 


og Gmpucteus 
A, ee 
mc? 


—39 42/3 
ae ioe, (2.13) 
To test for gravitational effects in GWEP, it will be necessary to employ objects containing 
substantial self-gravity, such as planets or neutron stars (Section 8.1). 
Assuming the 1o bound of |7| < 2 x 10~!? from the latest summary of Eét-Wash exper- 
iments (Wagner et al., 2012), which compared Beryllium with Titanium and Aluminum, 
we show the corresponding bounds on the various 74 parameters in Table 2.1. 


2.3.2. Tests of Local Lorentz Invariance 


In Box 2.1 we reviewed the many experiments that support special relativity. One might 
therefore ask “what is there left to test?” Plenty, as it turns out. In recent years, an extensive 
theoretical and experimental effort has taken place to find or constrain violations of special 
relativity, or in more contemporary language, violations of Local Lorentz Invariance. 
The motivation for this effort is not a desire to repudiate special relativity, but to look 
for evidence of new physics “beyond” special relativity, such as apparent, or “effective” 
violations of Lorentz invariance that might result from certain models of quantum gravity, 
or from the existence of feeble long-range fields in the universe that might couple to matter. 
Some formulations of quantum gravity assert that there is a fundamental length scale given 
by the Planck length, ¢p; = (iG/c?)!/? = 1.6x 10~%> m, but since length is not an invariant 
quantity, then there could be a violation of Lorentz invariance at some level in quantum 
gravity. In brane-world scenarios, while physics may be locally Lorentz invariant in the 
higher dimensional world, the confinement of the interactions of normal physics to our 
four-dimensional “brane” could induce apparent Lorentz-violating effects. And in models 
such as string theory, the presence of additional scalar, vector, and tensor long-range fields 
that couple weakly to matter of the standard model could induce effective violations of 
Lorentz symmetry. These and other ideas have motivated a serious reconsideration of how 
to test Local Lorentz Invariance with better precision and in new ways. 
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In many ways, the model for this new approach to testing Local Lorentz Invariance 
is the classic Hughes-Drever experiment. It was actually two experiments, performed in 
1959-1960 independently by Hughes and collaborators at Yale University and by Drever 
at Glasgow University (Hughes et al., 1960; Drever, 1961). The experiments examined 
the J = 3/2 ground state of the "Li nucleus in an external magnetic field. The state is 
split into four levels by the magnetic field, with equal spacing in the absence of external 
perturbations, so the transition line is a singlet. Any external perturbation associated with 
a preferred direction in space that has a quadrupole (¢ = 2) component will destroy the 
equality of the energy spacing and will split the transition lines. Using NMR techniques, 
the experiments set a limit of 0.04 Hz (1.7 x 107!® eV) on the separation in frequency 
(energy) of the lines. The original interpretation of this result, based on an idea by Cocconi 
and Salpeter (1958) is that it sets a limit on a possible anisotropy in inertial mass induced 
by some tensor field associated with the galaxy. 

Phenomenologically, we can express that anisotropy in the energy of the system in 
the form 

bE ~ AEA, (2.14) 
A 
where E* is the internal energy affected by the interaction, and 5“ is a dimensionless 
parameter that measures the strength of the anisotropy induced by the interaction. Using 
the formulae from Section 2.3.1, we can then make estimates of E“ for ’Li and obtain the 
following limits 


\58| <4 x 107-74, la) <5x 10°-~, 
los|<8x10-", la" |<4x10-*. (2.15) 


A different interpretation treats these experiments as tests of Local Lorentz Invariance. 
If there are vector or tensor fields in the universe that are linked to the cosmological 
distribution of matter, then in the mean rest frame of that matter distribution (as represented 
by the frame in which the cosmic microwave background is isotropic), those fields will take 
on privileged forms, for example (K, 0, 0, 0), for a timelike vector field K“. Because the 
Earth moves at 368km_ s°! relative to that frame, then there will be a preferred spatial 
direction associated with those fields. If any of those fields then interacts with local matter, 
then there could be shifts in the energy levels of atoms and nuclei that depend on the 
orientation of the quantization axis of the state relative to our universal velocity vector, 
and on the quantum numbers of the state, leading to an apparent “anisotropy” of the energy 
or mass of the state. If the induced anisotropies are a quadrupole effect, one would expect 
the net effect to be proportional to the square of the velocity w of the laboratory, if they 
are a dipole effect, one would expect the net effect to be linear in the velocity. If 5} is a 
parameter that measures the “bare” strength of the LLI violation, then one would expect 


Aw Wo. (2.16) 


where ¢ = | or 2, depending on the nature of the anisotropy. 

In order to make this discussion more concrete, we will use a specific theoretical 
framework, called the “c~ formalism,” to be discussed in more detail in Section 2.5.5. 
In this formalism, which is restricted to electromagnetic interactions, one assumes that 
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the net effect of a nonmetric coupling of electrodynamics to some external fields is that 
the effective speed of electromagnetic radiation c is no longer equal to the limiting speed 
co of test particles or of the unbroken Lorentz invariance. In the preferred universal 
rest frame, light propagates at speed c in all directions. Such a Lorentz-non-invariant 
electromagnetic interaction would cause the speed of light to depend on the observer’s 
motion relative to the preferred frame (a modern version of “ether drift”). The altered 
interatomic electromagnetic fields would cause moving rods to shrink in a manner that is 
given by the Lorentz-Fitzgerald factor (1 — v*/c})!/2, using co, not c. The alterations could 
lead to anisotropies in energy levels of atoms and nuclei. In the c? formalism the size of 
the effects induced turns out to be proportional to the velocity w relative to the preferred 
frame to an appropriate power, and to the coefficient 


(d0)¢2 formalism — 1 — —5- (2.17) 


Using this framework, one can analyze a variety of tests of Local Lorentz Invariance 
and place bounds on do; the bounds are summarized in Figure 2.3. The Hughes-Drever 
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Year of experiment 


Selected tests of Local Lorentz Invariance, showing bounds on the parameter do. which measures the degree of 
violation of Lorentz invariance in electromagnetism. The Michelson-Morley, Joos, Brillet-Hall, and cavity 
experiments test the isotropy of the round-trip speed of light. The centrifuge, two-photon absorption (TPA) and 
JPL experiments test the isotropy of light speed using one-way propagation. The most precise experiments test 
isotropy of atomic energy levels, including the Hughes-Drever experiments, and the University of Washington, 
NIST, and Harvard experiments, which exploited cold trapped atoms. The limits assume a speed of Earth of 
368km_ s7' relative to the mean rest frame of the universe. 
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experiments yielded extremely accurate bounds on 69. Dramatic improvements were made 
in the 1980s using laser-cooled trapped atoms and ions (Prestage et al., 1985; Lamoreaux 
et al., 1986; Chupp et al., 1989). This technique made it possible to reduce the broadening 
of resonance lines caused by collisions, leading to improved bounds on 49 shown in 
Figure 2.3 (experiments labelled U. Washington, NIST, and Harvard). 

Also included for comparison are the corresponding limits obtained from Michelson- 
Morley type experiments [for a review, see Haugan and Will (1987)]. In those experiments, 
when viewed from the preferred frame, the speed of light down the two arms of the moving 
interferometer is c, while it can be shown using the electrodynamics of the c” formalism, 
that the compensating Lorentz-FitzGerald contraction of the parallel arm is governed by 
the speed co. Thus the Michelson-Morley experiment and its descendants also measure the 
coefficient Jo. One of these is the Brillet-Hall experiment (Brillet and Hall, 1979), which 
used a Fabry-Perot laser interferometer. A number of experiments addressed the isotropy of 
the one-way speed of light, labelled Centrifuge, TPA and JPL (for a review and references, 
see Will (1992a)). In a recent series of experiments, the frequencies of electromagnetic 
cavity oscillators in various orientations were compared with each other or with atomic 
clocks as a function of the orientation of the laboratory (Lipa et al., 2003; Miiller et al., 
2003; Wolf et al., 2003; Antonini et al., 2005; Stanwix et al., 2005). These placed bounds 
on do at the level of better than a part in 10°. 

The fact that the bounds shown in Figure 2.3 appear to end around 2010 should not 
be taken to mean that tests of LLI have stopped. If anything, they have intensified. 
The simple electrodynamical c? formalism has to a large extent been supplanted by the 
“Standard Model Extension” (SME) developed by Kostelecky and colleagues (Colladay 
and Kostelecky, 1997, 1998; Kostelecky and Mewes, 2002). This framework extends the 
possibility of Lorentz violation (as well as CPT violation) to the entire standard model 
of particle physics. It has a very large number (several hundred) of parameters covering 
everything from electrodynamics to electrons and neutrinos, to hadrons and gluons, 
opening up many new opportunities for experimental tests. We will give a very brief 
introduction to the SME in Section 2.6. For example, the cavity experiments described 
above (Lipa et al., 2003; Miiller et al., 2003; Wolf et al., 2003) placed bounds on the 
coefficients of the tensors &,_ and #1 (see Section 2.6 for definitions) at the levels 
of 10~' and 10~'°, respectively. Direct comparisons between atomic clocks based on 
different nuclear species place bounds on SME parameters in the neutron and proton 
sectors, depending on the nature of the transitions involved. The bounds achieved range 
from 10-77 to 10732 GeV. Recent examples include Wolf et al. (2006) and Smiciklas 
et al. (2011). Several bounds have been placed on the combination of SME parameters 
Rtr — 4coo/3 or on At, alone; depending on the specific model, these are equivalent to 
60/2. The most stringent bounds to date come from collision experiments at the Large 
Electron-Positron Collider (LEP) (Altschul, 2009), and from the absence of certain decay 
channels in the observation of a clean ultrahigh-energy cosmic ray (UHECR) event at the 
Pierre Auger Observatory (Klinkhamer and Schreck, 2008; Klinkhamer and Risse, 2008). 
For other bounds on these parameters and on all the parameters of the SME, from the 
photon sector to the Higgs sector, see Kostelecky and Russell (2011). 

Astrophysical observations have been used to bound Lorentz violations. For example, if 
photons satisfy the Lorentz violating dispersion relation 
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(3) 
Bape + Ep f™ pile+ frp + pre ces (2.18) 
Pl 


where Ep, = (fic? /G)'/? is the Planck energy, and f (”) are dimensionless coefficients, then 
the speed of light v, = 0E/Op would be given, to linear order in the f by 


n—2 
Et es eG yr” ( =) ; (2.19) 


E 
n>1 Fl 


Such a Lorentz-violating dispersion relation could be a relic of quantum gravity, for 
instance. By bounding the difference in arrival time of high-energy photons from a burst 
source at large distances, one could bound contributions to the dispersion for n > 2. One 
limit, | f (3) < 128 comes from observations of 1 and 2 TeV gamma rays from a flare 
associated with the active galaxy Markarian 421 (Biller et al., 1999). Another limit comes 
from birefringence in photon propagation: in many Lorentz violating models, different 
photon polarizations may propagate with different speeds, causing the plane of polarization 
of a wave to rotate. If the frequency dependence of this rotation has a dispersion relation 
similar to Eq. (2.18), then by studying “polarization diffusion” of light from a polarized 
source in a given bandwidth, one can effectively place a bound | f)| < 10~4 (Gleiser and 
Kozameh, 2001). Measurements of the spectrum of ultra-high-energy cosmic rays using 
data from the HiRes and Pierre Auger observatories show no evidence for violations of 
Lorentz invariance (Stecker and Scully, 2009; Bi et al., 2009). 

For thorough surveys of both the theoretical frameworks and the experimental results 
for tests of LLI see the reviews by Mattingly (2005), Liberati (2013) and Kostelecky and 
Russell (2011). The last article gives “data tables” showing experimental bounds on all the 
various parameters of the SME. 

Local Lorentz Invariance can also be violated in gravitational interactions; these will be 
discussed under the rubric of “preferred-frame effects” in Chapter 8. 


2.3.3 Tests of Local Position Invariance 


The two main tests of Local Position Invariance are gravitational redshift experiments, 
which test the existence of spatial dependence on the outcomes of local experiments, and 
measurements of the constancy of the fundamental nongravitational constants, which test 
for temporal dependence. 


Gravitational redshift experiments 


The quintessential gravitational redshift experiment measures the frequency or wavelength 
shift Z = Av/v = —AX/X between two identical frequency standards (clocks) placed at 
rest at different heights in a static gravitational field. To illustrate how such an experiment 
tests Local position Invariance, we will assume that the remaining parts of EEP, namely, 
WEP and Local Lorentz Invariance, are valid. (In Sections 2.4 and 2.5, we will discuss this 
question under somewhat different assumptions.) WEP guarantees that there exist local 
freely falling frames whose acceleration g relative to the static gravitational field is the 
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same as that of test bodies. Local Lorentz Invariance guarantees that in these frames, the 
proper time measured by an atomic clock is related to the Minkowski metric by 


Cdr x —Nyvdxpdxk x dt; — dxp — dyg — dz, (2.20) 


where xp are coordinates attached to the freely falling frame. However, in a local freely 
falling frame that is momentarily at rest with respect to the atomic clock, we permit its 
rate to depend on its location (violation of Local Position Invariance), that is, relative to 
a chosen atomic time standard based on a clock whose fundamental structure is different 
than the one being analyzed, the proper time between ticks is given by 


r=7(U), (2.21) 


where U is a gravitational potential whose gradient is related to the test-body acceleration 
by VU=8 = ~ge.. 

Now the emitter, receiver, and gravitational field are assumed to be static. We define 
a static coordinate system in which the emitter and receiver are at rest in the spatial 
coordinates xs, and in which the time coordinate fg is defined such that the interval of 
time Ats between ticks (passage of wave crests) of the emitter and of the receiver are 
equal. In other words, the time coordinate is defined by counting crests from the emitter 
as they pass a given point and ascribing a chosen “unit” of time to that interval. The 
choice of coordinates is arbitrary, but in this case it is a natural choice because of the 
time translation invariance associated with the static nature of the problem. The static 
coordinates are not the freely falling coordinates of Eq. (2.20), but are accelerated upward 
(in the +z direction) relative to the freely falling frame, with acceleration g. Thus, for 
lgts/c| ~ |gzs/c?| < 1 (ie. for g uniform over the distance between the clocks), a 
sequence of Lorentz transformations yields (MTW, section 6.6) 


ctp = (zg + c*/g) sinh(gts/c), 

zp = (zs + c”/g) cosh(gts/c), 

XF = Xs, 

Yr =ys. (2.22) 
Thus the time measured by the atomic clocks (relative to the standard clock) is given by 


cdr? =1°(U) [c*dtg — dee — dyz — dz] , 


2 
= 7°(U) (1 a 2) Cdk — de — dy — ae (2.23) 


Since the emission and reception rates are the same (1/Av¢s) when measured in static 
coordinate time, and since dvg = dys = dzs = 0 for both clocks, the measured rates 
(v = Ar~') are related by 


7- Vrec — Vem =4 Ec (1 =r a1 


T(Uem) (1 se Bi anj tC) (2.24) 


For small separations Az = Zree — Zem, We can expand 7(U) in the form 


T(Usec) & To — TGAZ + O(Az’) ; (2.25) 


Vrec 
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where 7 = T(Uem) and 7) = dr /dU|em. Then 
Z=(1+a)AU/c’, (2.26) 


where a = —c7} /T| and AU = g-Ax = —g(Zrec—Zem). If there is no location dependence 
in the clock rate, then a = 0, and the redshift is the standard prediction, that is, 


AZ=AU/e’. (2.27) 


Although there were several attempts following the publication of the general theory 
of relativity to measure the gravitational redshift of spectral lines from the Sun and from 
white dwarf stars, the results were inconclusive (see Bertotti et al. (1962) for review). 
During the early years of general relativity, the failure to measure this effect in solar 
lines was seized upon by some as reason to doubt the theory (see Crelinsten (2006) for 
an engaging history of this period). Unfortunately, the measurement is not simple. Solar 
spectral lines are subject to the “limb effect,” a variation of spectral line wavelengths 
between the center of the solar disk and its edge or “limb”; this effect is actually a Doppler 
shift caused by complex convective and turbulent motions in the photosphere and lower 
chromosphere, and is expected to be minimized by observing at the solar limb, where the 
motions are predominantly transverse to the line of sight. Some atoms are also subject to 
line shifts caused by the enormous pressure of the solar atmosphere. The secret is to use 
strong, symmetrical lines, leading to unambiguous wavelength measurements. Successful 
measurements were finally made in the 1960s and 1970s (Brault, 1962; Snider, 1972); 
LoPresto et al. (1991) measured the solar shift in agreement with LPI to about 2 percent 
by observing the oxygen triplet lines both in absorption in the limb and in emission just 
off the limb. Roca Cortés and Pallé (2014) used helioseismology data from 1976 to 2013 
to measure the solar shift to about 5 percent; the error was dominated by effects due to the 
asymmetry of the spectral line used. 

The first successful, high-precision redshift measurement was the series of Pound- 
Rebka-Snider experiments of 1960-1965 (Pound and Rebka, 1960; Pound and Snider, 
1965), which measured the frequency shift of y-ray photons from >’Fe as they ascended 
or descended the Jefferson Physical Laboratory tower at Harvard University. The high 
accuracy of | percent was obtained by making use of the recently discovered Méssbauer 
effect, whereby the recoil momentum of the nucleus from the emitted y-ray was absorbed 
by the entire lattice, in order to produce a narrow resonance line whose shift could be 
accurately determined. 

But the 1970s saw the beginning of a new era in redshift experiments, with the devel- 
opment of atomic frequency standards of ultrahigh stability, reaching parts in 10'°—10'° 
over averaging times of 10-100 seconds and longer. Examples included atomic clocks 
based on Cesium, Rubidium and Hydrogen, and clocks based on stabilized microwave and 
optical cavities. The development of laser cooling of atoms and molecules in the 1980s led 
to even more stable frequency standards, based on cold atoms, Bose-Einstein condensates, 
atom fountains, and atom interferometry. These technological developments led to new 
and more precise tests of LPI. 

The first of these was the famous Hafele-Keating (1972a; 1972b) experiment, which took 
cesium-beam atomic clocks around the world on commercial airline flights, one eastward 
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and one westward, and compared the readings on the traveling clocks upon their return 
with the readings on identical stay-at-home clocks. The differences in elapsed times, which 
included both the gravitational redshift and special relativistic time dilation agreed with 
LPI to about 10 percent. 

The most precise standard redshift test to date was the Vessot-Levine rocket experiment 
known as Gravity Probe A (GPA) that took place in June 1976 (Vessot et al., 1980). 
A hydrogen-maser atomic clock was flown on a rocket to an altitude of about 10,000 km 
and its frequency compared to a hydrogen-maser clock on the ground. The experiment took 
advantage of the masers’ frequency stability (parts in 10'° over 100 s averaging times) by 
monitoring the frequency shift as a function of altitude. A sophisticated data acquisition 
scheme accurately eliminated all effects of the first-order Doppler shift due to the rocket’s 
motion, while tracking data were used to determine the payload’s location and the velocity, 
in order to evaluate the potential difference AU and the special relativistic time dilation 
throughout the flight. Analysis of the data yielded a limit |a| < 2 x 107+. This could 
be improved by an order of magnitude by measuring the redshift between the ground 
and hydrogen-maser clocks on two European global navigation satellites (Galileo 5 and 
6). Because these satellites were mistakenly launched into eccentric orbits instead of the 
intended circular orbits, integrating the modulations of the redshift over a long time will 
improve the statistics (Delva et al., 2015). 

The varying gravitational redshift of Earth-bound clocks relative to the highly stable 
millisecond pulsar B1937+21 was measured to about 10 percent (Taylor, 1987). This effect 
is caused by the Earth’s monthly oscillation with an amplitude of about 4000 km in the solar 
gravitational field resulting from its motion around the Earth-Moon center of mass. Two 
measurements of the redshift using stable oscillator clocks on spacecraft were made at the 
1 percent level: one used the Voyager spacecraft in Saturn’s gravitational field (Krisher 
et al., 1990a), while another used the Galileo spacecraft in the Sun’s field (Krisher et al., 
1993). 

Advances in stable clocks also made possible a new type of redshift experiment that is 
a direct test of Local Position Invariance (LPI): a “null” gravitational redshift experiment 
that compares two different types of clocks, side by side, in the same laboratory. If LPI is 
violated, then not only can the proper ticking rate of an atomic clock vary with position, 
but the variation must depend on the structure and composition of the clock, otherwise all 
clocks would vary with position in a universal way and there would be no operational way 
to detect the effect (since one clock must be selected as a standard and ratios taken relative 
to that clock). Thus, we must write, for a given clock type A, 


THU) =f (1-04 U/e’) . (2.28) 


Then a comparison of two different clocks at the same location would measure 


A A 
== (5) [ (a4 — a®) sl. (2.29) 
0 


where (74/7?)o is the constant ratio between the two clock times observed at a chosen 
initial location. 
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The first null redshift experiment of this type was performed in April 1978 at Stanford 
University. The rates of two hydrogen maser clocks and of an ensemble of three 
superconducting-cavity stabilized oscillator (SCSO) clocks were compared over a 10-day 
period (Turneaure et al., 1983). During this time, the solar potential U in the laboratory 
changed sinusoidally with a 24-hour period by 3 x 10~!° because of the Earth’s rotation, 
and changed linearly at 3 x 107! per day because the Earth is 90° from its perihelion in 
April. Analysis of the data set an upper limit on both effects, leading to a limit on the LPI 
violation parameter 


la Sah?) .< 10-2; (2.30) 


Over the following years, this bound was improved using more stable frequency standards, 
such as atomic fountain clocks (Godone et al., 1995; Prestage et al., 1995; Bauch and 
Weyers, 2002; Blatt et al., 2008). The best current bounds, from comparing a *’Rubidium 
atomic fountain with either a '*?Cesium fountain or a hydrogen maser (Guéna et al., 2012; 
Peil et al., 2013), and from comparing transitions of two different isotopes of Dysprosium 
(Leefer et al., 2013), hover around the one part per million mark. 

Figure 2.4 summarizes the bounds on the parameter a from standard redshift exper- 
iments or on the difference a4 — ag for different types of clocks from null redshift 
experiments. 

The Atomic Clock Ensemble in Space (ACES) project will place both a cold 
trapped atom clock based on Cesium called PHARAO (Projet d’Horloge Atomique par 
Refroidissement d’Atomes en Orbite), and an advanced hydrogen maser clock on the 
International Space Station to measure the gravitational redshift to parts in 10°, as well as 
to carry out a number of fundamental physics experiments and to enable improvements in 
global timekeeping (Reynaud et al., 2009). Launch is currently scheduled for late 2018 or 
early 2019. 

Modern advances in navigation using Earth-orbiting atomic clocks and accurate time- 
transfer must routinely take gravitational redshift and time-dilation effects into account. For 
example, the US Global Positioning System (GPS) provides absolute positional accuracies 
of around 15 m (even better in its military mode), and 50 nanoseconds in time transfer 
accuracy, anywhere on Earth. Yet the difference in rate between satellite and ground clocks 
as a result of relativistic effects is a whopping 39 microseconds per day (46 jus from the 
gravitational redshift, and —7 vs from time dilation). If these effects were not accurately 
accounted for, GPS would fail to function at its stated accuracy. This represents a welcome 
practical application of general relativity! For the role of general relativity in GPS, see 
Ashby (2002, 2003); for a popular essay, see Will (2000). 

A final example of the almost “everyday” implications of the gravitational redshift is 
a remarkable measurement using optical clocks based on trapped aluminum ions of the 
frequency shift over a height of 1/3 of a meter (Chou et al., 2010). 


Variations of the fundamental constants 


Local Position Invariance also refers to position in time. If LPI is satisfied, the fundamental 
constants of nongravitational physics should be constants in time. If LPI is not satisfied, 
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Selected tests of Local Position Invariance via gravitational redshift experiments, showing bounds on cy, which 
measures the degree of deviation of redshift from the formula Av /1 = AU/c?. In null redshift experiments, 
the bound is on the difference in c between different kinds of clocks. 


then some constants could vary with time as one or more external fields that couple to 
matter in a nonmetric way evolve with time as the universe expands. 

Early ideas about the possibility of time variations of the constants originated with 
Dirac, and were based on the so-called “large numbers coincidence,” in which a certain 
dimensionless combination of fundamental constants produced the very large number 
10*°, while another combination of constants together with the age of the universe 
produced the same large number. Was this merely a remarkable coincidence related to 
the specific conditions of the present epoch (such as the existence of physicists to make 
such speculations), or was it a hint of a fundamental law that would therefore require 
at least one constant to vary along with the age of the universe in order to maintain 
the equality between the large numbers? Dyson (1972) gives a detailed review of these 
early ideas. For a comprehensive recent reviews both of experiments and of theoretical 
ideas that underlie proposals for varying constants, see Uzan (2011) and Martins (2017). 
Here we will focus on recent observational bounds on cosmological variations of selected 
dimensionless combinations of the nongravitational constants (we will discuss variations 
of the Newtonian gravitational constant in Section 8.5). 
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The bounds on varying constants come in two types: bounds on the present rate of 
variation, and bounds on the difference between today’s value and a value in the distant 
past. The main example of the former type is the clock comparison test, in which highly 
stable atomic clocks of different fundamental type are intercompared over periods ranging 
from months to years (variants of the null redshift experiment). If the frequencies of the 
clocks depend differently on the electromagnetic fine structure constant agy, the weak 
interaction constant aw, the electron-proton mass ratio m./m, or the proton gyromagnetic 
ration g,, for example, then a limit on a drift of the ratio of the frequencies translates into a 
limit on a drift of the constant(s). The dependence of the frequencies on the constants may 
be quite complex, depending on the atomic species involved. Recent tests have exploited 
the techniques of laser cooling and trapping, and of atom fountains, in order to achieve 
extremely high long-term clock stability. Specific experiments compared the ground-state 
hyperfine transition in '3*Cesium to the *’Rubidium hyperfine transition (Marion et al., 
2003), the '’Mercury ion electric quadrupole transition (Bize et al., 2003), the atomic 
Hydrogen 1S-2S transition (Fischer et al., 2004), and an optical transition in '7! Ytterbium- 
(Peik et al., 2004). More recent experiments have used *’ Strontium atoms trapped in optical 
lattices compared with Cesium (Blatt et al., 2008) to obtain dgm/arm < 6 x 107!% yr7!, 
compared ®’Rubidium and !*?Cesium fountains (Guéna et al., 2012) to obtain dgm/aEm < 
2.3 x 10~!° yr—!, or compared two isotopes of Dysprosium (Leefer et al., 2013) to obtain 
Geo /QEM < 13x 10~!6 yr. 

The second type of bound involves measuring the relics of or signal from a process 
that occurred in the distant past and comparing the inferred value of the constant with the 
value measured in the laboratory today. One subtype uses astronomical measurements of 
spectral lines at large redshift, while the other uses fossils of nuclear processes on Earth to 
infer values of constants early in geological history. 

Earlier comparisons of spectral lines of different atoms or transitions in distant galaxies 
and quasars produced bounds on variations of apm or g,(m_/m,) on the order of 1 part 
in 10 per Hubble time (Wolfe et al., 1976). Dramatic improvements in the precision of 
astronomical and laboratory spectroscopy, in the ability to model the complex astronomical 
environments where emission and absorption lines are produced, and in the ability to 
reach large redshift have made it possible to improve the bounds significantly. In fact, 
in 1999, Webb and collaborators (Webb et al., 1999; Murphy et al., 2001) announced 
that measurements of absorption lines in Magnesium, Aluminum, Silicon, Chromium, 
Iron, Nickel and Zinc in quasars in the redshift range 0.5 < Z < 3.5 indicated a 
smaller value of vem in earlier epochs, namely Aa gm/agm = (—0.72 + 0.18) x 107°, 
corresponding to @gm/Qgm = (6.4 + 1.4) x 107!% yr! (assuming a linear drift with 
time). The Webb group continues to report data supporting changes in agm over large 
redshifts (King et al., 2012; Bainbridge and Webb, 2017). Measurements by other groups 
have so far failed to confirm this nonzero effect (Quast et al., 2004; Chand et al., 2005); 
an analysis of Magnesium absorption systems in quasars at 0.4 < Z < 2.3 gave 
Gem/Qem = (—0.6 + 0.6) x 107!°yr—! (Srianand et al., 2004). More recent studies 
have also yielded no evidence for a variation in @gy (Kanekar et al., 2012; Lentati et al., 
2013). 
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Table 2.2 Bounds on cosmological variations of nongravitational constants. 


Constant k Limit on k/ k (yr!) Redshift Method 
Fine structure KIB x 1O-™ 0 Clock comparisons 
constant <0.5 x 107 !° 0.15 Oklo natural reactor 
(aem = e?/hc) Bal se 7 0.45 '87Re decay 

EID 2 10-™ 0.4-2.3 Spectra in distant quasars 
Weak interaction il x 10" 0.15 Oklo natural reactor 
constant <5 x 107” 10° Big Bang nucleosynthesis 
(aw = Gamzc/h’) 
e-p mass ratio <3.3 x 107 0 Clock comparisons 

<3 x 107° 2.6—3.0 Spectra in distant quasars 


Another important set of bounds arises from studies of the “Oklo” phenomenon, a group 
of natural, sustained 7*>Uranium fission reactors that occurred in the Oklo region of Gabon, 
Africa, around 1.8 billion years ago. Measurements of ore samples yielded an abnormally 
low value for the ratio of two isotopes of Samarium, '*?Sm/'4’Sm. Neither of these isotopes 
is a fission product, but '4°Sm can be depleted by a flux of neutrons. Estimates of the 
neutron fluence (integrated dose) during the reactors’ “on” phase, combined with the 
measured abundance anomaly, yield a value for the neutron cross-section for '4?Sm 1.8 
billion years ago that agrees with the modern value. However, the capture cross-section is 
extremely sensitive to the energy of a low-lying level (EF ~ 0.1 eV), so that a variation in 
the energy of this level of only 20 meV over a billion years would change the capture cross- 
section from its present value by more than the observed amount. This was first analyzed 
by Shlyakter (1976). Recent reanalyses of the Oklo data (Damour and Dyson, 1996; Fujii, 
2004; Petrov et al., 2006) lead to a bound on gy at the level of around 5 x 107!” yr7!. 

In a similar manner, reanalyses of decay rates of '*7Rhenium in ancient meteorites 
(4.5 billion years old) gave the bound |dgy/agm| < 3.4 x 107!%yr~! (Olive et al., 
2004). Finally, bounds on any variation of the weak interaction coupling constant 
aw have been set by comparing predicted abundances of the light elements produced 
during Big-Bang nucleosynthesis and the observationally inferred primordial abundances 
(Malaney and Mathews, 1993; Reeves, 1994). The current best bounds are summarized in 
Table 2.2. 


2.4 Schiff’s Conjecture 


Because the three parts of the Einstein Equivalence Principle discussed above are so very 
different in their empirical consequences, it is tempting to regard them as independent 
theoretical principles. However, any complete and self-consistent gravitation theory must 
possess sufficient mathematical machinery to make predictions for the outcomes of 
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experiments that test each principle, and because there are limits to the number of ways 
that gravitation can be meshed with the standard model of particles and their interactions, 
one might not be surprised if there were theoretical connections between the three 
subprinciples. For instance, the same mathematical formalism that produces equations 
describing the free fall of a hydrogen atom in a gravitational field must also produce 
equations that determine the energy levels of hydrogen in that gravitational field, and 
thereby determine the ticking rate of a hydrogen maser clock. Hence a violation of EEP 
in the fundamental machinery of a theory that manifests itself as a violation of WEP 
might also be expected to lead to a violation of Local Position Invariance. Around 1960, 
Leonard Schiff conjectured that this kind of connection was a necessary feature of any 
self-consistent theory of gravity. 

More precisely, Schiff’s conjecture stated that any complete, self-consistent theory of 
gravity that embodies WEP necessarily embodies EEP. In other words, the validity of WEP 
alone guarantees the validity of Local Lorentz and Position Invariance, and thereby of 
EEP. This form of Schiff’s conjecture is an embellished classical version of his original 
1960 quantum mechanical conjecture (Schiff, 1960). His interest in this conjecture was 
rekindled in November 1970 by a vigorous argument with Kip Thorne at a conference on 
experimental gravitation held at the California Institute of Technology. Unfortunately, his 
untimely death in January, 1971 cut short his renewed effort. 

Schiff’s conjecture, taken literally, implies that the E6tvos experiments are the direct 
empirical foundation for EEP, and for the interpretation of gravity as a curved-spacetime 
phenomenon, and that gravitational redshift experiments, for example, are weak tests of 
gravitation theory and thus unnecessary. For these reasons, much effort went into “proving” 
Schiff’s conjecture. Of course, a rigorous proof of such a conjecture is impossible, and in 
fact some counterexamples are known. And even if Schiff was right, this has not stopped 
people from carrying out gravitational redshift experiments or tests of Local Lorentz 
Invariance, motivated at least in part by the notion that any confirmed violation of these 
two principles could be the signal of entirely new physics. 

Nevertheless, Schiff’s conjecture provides a very useful platform for viewing the deep 
connections among phenomena described by the parts of EEP, and therefore we will devote 
some time to analyzing a range of arguments that support the conjecture. 

We first restrict attention to theories of gravity based on a Lagrangian. We consider 
an idealized composite body made up of structureless test particles that interact by some 
nongravitational force to form a bound system and focus on the Hamiltonian governing the 
system. For a body that moves sufficiently slowly in a weak, static gravitational field, the 
laws governing its motion can be put into a quasi-Newtonian form (we assume the theory 
has a Newtonian limit). In particular, the conserved energy of the system, as obtained from 
the Hamiltonian will be assumed to be of the form 


1 
Ec = Mpc? + 5 Mal” — MgU(X) + c~?O(MRU*, MV", MpVU), (2.31) 


where X and V are the quasi-Newtonian location and velocity of the center of mass of the 
body, Mr is the “rest” mass of the body, U is the external gravitational potential, and c is 
a fundamental speed used to convert units of mass into units of energy. If EEP is violated, 
we must allow for the possibility that the speed of light and the limiting speed of material 
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particles may differ in the presence of gravity; to maintain this possibility we do not set 
c = | automatically in Eq. (2.31) (see also Box 2.2). Note that V is the velocity relative 
to some preferred frame. In problems involving external, static gravitational potentials, 
the preferred frame is generally the rest frame of the external potential, while in problems 
involving cosmological gravitational effects where localized potentials can be ignored, the 
preferred frame is that of the universe rest frame. [In problems involving both kinds of 
effects, the simple form of Eq. (2.31) no longer holds.] 

The possible occurrence of EEP violations arises when we write the rest mass Mp in 
the form 


Mp = Mo —c 7 Ep(X, V), (2.32) 


where Mp is the sum of the rest masses of the structureless constituent particles, and Eg 
is the binding energy of the body. It is the position and velocity dependence of Ep, a 
dependence that in general is a function of the structure of the system, which signals the 
breakdown of EEP. Roughly speaking, an observer in a freely falling frame can monitor 
the binding energy of the system, thereby detecting the effects of his location and velocity 
in local nongravitational experiments. Haugan (1979) made this more precise by showing 
that in fact it is the possible functional difference in Eg(X, V) between the system under 
study and a “standard” system arbitrarily chosen as the basis for the units of measurement 
that leads to measurable effects. Because the location and velocity dependence in Fx is a 
result of the external gravitational environment, it is useful to expand it in powers of U and 
V*. To an order consistent with the quasi-Newtonian approximation in Eq. (2.31), we write 


Ep(X, V) = ES + 6miU*(x) — simi ie OL BRO wicca )s (2.33) 


where U/" is the external gravitational potential tensor [see Eq. (4.33)]; it is of the same 
order as U and satisfies U7 = U. Summation over repeated spatial indices j and k is 
assumed. The quantities 6m} and dm} are called the anomalous passive gravitational 
and inertial mass tensors, respectively. They are expected to be of order 7E%, where 
7 is a dimensionless parameter that characterizes the strength of EEP-violating effects; 
they depend upon the detailed internal structure of the composite body. We now assume 
that, associated with the Hamiltonian that governs the composite system in the external 
environment is a conjugate momentum and a conserved energy of the form 


PJ = (My — c~7E8)Vi + 6m, 


1 o Fs ee 

Ec = Moc? — E+ 5 [ (1% — co 7 BA) 6 + Sm | vive 

= [(M% — 7289) 5k 4 bmp | ui (2.34) 

We first show that this yields violations of Local Position and Lorentz Invariance. We 

imagine that our test system makes a transition from one internal state to another, emitting 
a photon of frequency v. The change in the rest energy of the test system is 


AMgc” = [(Mr)i — (Mr)? = —AER = hry, (2.35) 


where / is Planck’s constant. Conservation of energy and momentum, including the emitted 
photon implies that 
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hv = AEc 7 
clhve, = AP, (2.36) 
where e, is a unit vector in the direction of the emitted photon. Combining these 
conservation laws, and including the changes AV, Adm}, and Adm; * between the initial 


and final state, and working to first order in the small changes, we find that the emitted 
frequency of the photon is “er by 


y= 7 '- = aa (O* + af") vivt- 5 = (6* + af’) uN , (237) 
YY 
where 
ait = A(dmf")c? 
aes hi : 
. ING) Jky 2 
aj = ae (2.38) 


and V is the final velocity of the test system. The factor in the denominator in Eq. (2.37) is 
the usual kinematical first-order Doppler effect. 

As it stands, however, the result of Eq. (2.37) is not very informative, because we have 
no clear way to measure the frequency 1; even far from the Earth, we cannot be certain that 
there is not some residual potential U/* due to the galaxy or the universe, or any residual 
motion relative to some preferred frame. However, what can be measured unambiguously 
is the ratio of the frequency of the photon to that of a photon emitted by test system of a 
different internal composition, that is adjacent to the first system and at rest with respect to 
it. This system will be chosen to be the “standard” of frequency. The ratio between the two 
frequencies would then be given by 


~ oe {! ; [(ai!). = (a("),| ia [(a M2 — (a ‘aril un . (2.39) 


If EEP is valid, so that at = =a = 0 for all systems, then the result is the universal value 
of the frequency of the transition of system 2 in units of the standard frequency. If the 
anomalous passive gravitational mass tensor dm} is nonzero, it can produce preferred- 
location effects in a null gravitational redshift experiment. If the anomalous inertial 
mass tensor dm" is nonzero, it can produce preferred-frame effects in a Hughes-Drever 
experiment. 

A different argument, based only upon the assumption of energy conservation, can be 
used to demonstrate the link between WEP and LPI and LLI. This assumption allows one 
to perform very simple cyclic gedanken experiments in which the energy at the end of the 
cycle must equal that at the beginning of the cycle. This approach was pioneered by Dicke 
(1964), and subsequently generalized by Nordtvedt (1975) and Haugan (1979). 

We begin with a set of n free particles of mass mo at rest at X = h. From Eq. (2.34), the 
conserved energy is simply nmoc?[1 — U(h)/c?]. We then form a composite body out of 
these particles and convert the released energy to free particles of rest mass mo, stored in a 
massless reservoir. The conserved energy of the composite body is 


Ecomp(h) = Mpc? — [Meo + mf] Uh), (2.40) 
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where Mp = nmg — c~7Ey, and that of the reservoir is 
U(h key sik 
Exes(h) = E®, (1 - 2) + dm U"*(h). (2.41) 
c 

The composite body falls freely to X = 0 with an acceleration assumed to be A, while 
the stored test particles fall with acceleration g = WU (by definition), reaching X = 0 
with velocities (—2A - h)!/? and (—2g - h)!/?, respectively (we assume that g, V and h are 
parallel). At X = 0, the energies of the composite particle and the test particles are 


Ecomp(0) = Mpc? — [Mino ms bm | uik(Q) — [ano i Sm" | Ain, (2.42a) 
Exes(0) = ER ! os | + dmpUM(h)[1 + OCU/e?)], (2.42b) 
Cc é 


The particle is brought to rest and the acquired kinetic energy —[Mpd/* + dm) "\A/ h is 
stored in the reservoir. We now extract from the reservoir enough energy E},[1— U(0)/c?|+ 
om) UF*(0) to disassemble the composite particle into its n constituent test masses, plus 
enough energy —nmog -h to return them to X = A, arriving at rest with the same conserved 
energy nmgc?(1 — U(h)/c?] as they had before. The reservoir now contains 


— ae Eo a . 
Eres(0) = —[Mpd"* + 5mj* ASHE + amo = 3 g-h+ dm [u* (a) — U*(0)] 
S [Mele — A’) — bial + Simp U0), hi. (2.43) 


If energy is to be conserved, the reservoir must be empty, and thus, working to first order 
in the small difference A — g, we obtain 


Ai = gi — Bie! + put, (2.44) 
where 
ik 
jk _ OMp 
as 
6m 
oF ae (2.45) 
R 


where My = My — c-* Ey. Again, we observe the deep link between violations of LPI and 
LLI and violations of WEP. If WEP is satisfied (Sm a dm} * — 0 for all systems), then so 
are LPI and LLI. 

A variant of this cyclic gedanken experiment was devised by Nordtvedt (1975) in order 
to derive the gravitational redshift between systems separated by a height h. We begin as 
before with n free particles of mass mo at rest at X = h (for simplicity we will assume 
that the overall velocity relative to a preferred frame vanishes). We again form a composite 
body with conserved energy given by Eq. (2.40). This time we convert the released energy 
into a photon, which propagates downward to X = 0. Its energy is compared to that of 
a photon emitted from an identical composite system residing at X = 0. We will assume 
that the energy of the arriving photon is given by (1 — Z)[E¥(1 — U(0)/c”) + dm} UF*(0)], 
where Z is the “redshift,” to be determined. This energy is added to the reservoir along 
with the kinetic energy —[Mpd/* + 6m/"|A/h* acquired by the falling composite body, less 
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an amount of energy required to disassemble the body and return it to X = h. Substituting 
Eq. (2.44) for A’, we find that the reservoir contains 


Eres(0) = —Z {8 [ om | + smgsu™(a)} + EXg-h — dm}U(0) jh’. (2.46) 


The reservoir must be empty, and so solving for Z to first order in g - 4 ~ U we obtain 


1 5m} | 
Z=—5|AU poo |, (2.47) 
c E, 
where AU = g-h, and AU# = VU - h. 

By similar analyses, one can derive the effect of LLI violations on frequency shifts 
between moving emitters and receivers undergoing general motions or orbits (Haugan, 
1979; Wolf and Blanchet, 2016). In particular the work of Wolf and Blanchet (2016) 
yielded new insight into the so-called “noon-midnight” redshift. It has long been known 
that in any metric theory of gravity, there is no frequency shift due to the Sun’s gravitational 
field between two clocks on opposite sides of the Earth. Since the Earth is in free fall around 
the Sun, the only effect of the Sun’s field is a tidal effect, leading to a quadrupolar noon—6 
pm shift, but with an amplitude far too small to be detectable with current technology. But if 
EEP is violated, then the anomalous mass tensors 5m) and 5m" can contribute to a noon- 
midnight effect when different clocks are compared, and that can be tested to interesting 
levels, such as in an orbiting clock experiment called STE-QUEST, that is currently under 
study (Altschul et al., 2015). 

Thorne, Lee, and Lightman (1973) proposed a more qualitative “proof” of Schiff’s 
conjecture for that class of gravitation theories that are based on an invariant action 
principle. They begin by defining the concept of “universal coupling”: a generally covariant 
Lagrangian-based theory is universally coupled if it can be put into a mathematical form 
(representation) in which the action for matter and nongravitational fields [yg contains 
precisely one gravitational field: a symmetric, second-rank tensor 7 with signature +2 that 
reduces to 7 when gravity is turned off; and when w is replaced by 7, Ing becomes the 
action of special relativity. Clearly, among all Lagrangian-based theories, one is universally 
coupled if and only if it is a metric theory. 

Let us illustrate this point with a simple example. Consider a Lagrangian-based theory 
of gravity that possesses a globally flat background metric 7 and a symmetric, second-rank 
tensor gravitational field 4. The nongravitational action for charged point particles of rest 
mass m, and charge e, and for electromagnetic fields has the form 


ING = Ip =F Tint + Lom > (2.48) 
where 


Ib = —mo i dtr, dr= —(Nuv + Np )ax" dx” , 


Live = @ f Apdts" > 


1 1 
== Ma _ pyro (bv _ pbv\R RF a(1+— 
cag | On meV — HF Foo (14 sh)ate, 49) 
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where F’,,, = Ay, — Ay,v, and where 


| = [reel AY = On’ Phog, hen hy, (2.50) 


We work in a coordinate system in which 7 = diag(—1, 1, 1, 1). To see whether this 
theory is universally coupled, the obvious step is to assume that the single gravitational 
field ~,,, is given by 


Vv = Nv + hy. (2.51) 


This would make /p and Jin, appear to be universally coupled. However, in the electromag- 
netic Lagrangian, we obtain, for example, 


he — Abe = yho — nHPne + O(n), (2.52) 


where ||7)"|| = ||%pa||~!. Thus, there is no way to combine 7,,, and h,,, into a single 
gravitational field in Iyc, hence the theory is not universally coupled. To see that the theory 
is also not a metric theory, we transform to a frame in which at an event P, 


Wao(P) =nao Wava(P) =0. (2.53) 


Note that in this frame, Aao(P) A 0 in general, thus the action can be put into the form 


Inc = [srt + Al, (2.54) 
where 
p 1 ily 
Ispp = ing f dre f Apds” = ae | tPF oF gles, 
TT 
1! né,&,,p0 1 fj 76 ie. 30 Bs = 
AI = br 2a" anh + 8 (i? = 2n%h, 5) nt n? | FooFeal—a es. 


(2.55) 


So in a local Lorentz frame, the laws of physics are not those of special relativity, so the 
theory is not a metric theory. Notice that in this particular case, for weak gravitational fields 
(\hyv| < 1), the theory is metric to first order in /,,,, while the deviations from metric form 
occur at second order in h,,,,. In the next section, we will present a mathematical framework 
for examining a class of theories with nonuniversal coupling and for making quantitative 
computations of its empirical consequences. 

Consider now all Lagrangian-based theories of gravity, and assume that WEP is valid. 
WEP forces /yg to involve one and only one gravitational field (which must be a second- 
rank tensor ¢ which reduces to 7 far from gravitating matter). If yg were to involve 
some other gravitational fields ¢, K,, Ayv,... they would all have to conspire to produce 
exactly the same acceleration for a body made largely of electromagnetic energy as for one 
made largely of nuclear energy, and so on. This is unlikely unless 7%,,,,, and the other fields 
appear everywhere in Jvg in precisely the same form, for example, f(¢)y,,, ifa scalar field 
is present, w,,, + ah, if a tensor field is present, and so on. In this case, one can absorb 
these fields into a new field g,,,, and end up with only one gravitational field in yg. This 
means that the theory must be universally coupled, and therefore a metric theory, and must 
satisfy EEP. 
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One possible counterexample to Schiff’s conjecture was proposed by Ni (1977): a 
pseudoscalar field ¢ that couples to electromagnetism in a Lagrangian term of the form 
hehoB F pvtog, Where e#¥°8 is the completely antisymmetric Levi-Civita symbol. Ni 
argued that such a term, while violating EEP, does not violate WEP in the sense of 
composition dependent accelerations, although it does have the observable effect of 
producing anomalous torques on systems of electromagnetically bound charged particles. 


2.5 The THe ps Formalism 


The discussion of Schiff’s conjecture presented in the previous section was very general, 
and perhaps gives compelling evidence for the validity of the conjecture. However, because 
of the generality of those arguments, there was little quantitative information. For example, 
no means was presented to compute explicitly the anomalous mass tensors dm} and 5m} 
for various systems. In order to make these ideas more concrete, we need a model theory 
of the nongravitational laws of physics in the presence of gravity that incorporates the 
possibility of both metric and nonmetric (nonuniversal) coupling. This theory should be 
simple, yet capable of making quantitative predictions for the outcomes of experiments. 
One such “model” theory is the THe, formalism, devised by Lightman and Lee (1973). It 
restricts attention to the motions and electromagnetic interactions of charged structureless 
test particles in an external, static, and spherically symmetric (SSS) gravitational field. 
It assumes that the nongravitational laws of physics can be derived from an action /NG 
given by 


ie = =F mo0 f (P= H2)"Par+ Seo f Ay lve 


es (cE* — p~'B’) d*x, (2.56) 
87 
(here we use units in which x and ¢ both have units of length) where mog, €q, and x!; (f) are 
the rest mass, charge, and world line of particle a; x° = t, vi = dx! /dt, E = VAo — Ao, 
B =V x A, and where scalar products between 3-vectors are taken with respect to the 
Cartesian metric 6’. The functions 7, H, ¢ and j: are assumed to be functions of a single 
external gravitational potential ©, but are otherwise arbitrary. For an SSS field in a given 
theory, 7, H, €, and jz will be particular functions of ®. 
For SSS fields, Eq. (2.56) is general enough to encompass all metric theories of 
gravitation. In a metric theory, the action /Ng is given by 


Ing = — mon f (sativa de-+ Dee f Aust veat 


~~ TF gi BP Fap(—g)/7d'x, (2.57) 

loa 
where g is the determinant of g,,,, and F,,, = Ay, — Ay, - For a static spherically 
symmetric spacetime in isotropic coordinates, g99 = —T, go, = 0, and gy = Hodjx. It is 


then straightforward to show that the action in a metric theory takes the form 
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hus = Yomog f (P= v2)" + Yee f Aylae ve 


1/2 1/2 
ref |@) EP (=) a (2.58) 


Thus any metric theory can be cast in the THej: form but with € and ju given by 


1/2 
€="U= () : (2.59) 


It also encompasses a wide class of nonmetric theories, such as the Belinfante Swihart 
(1957a, 1957b, 1957c) theory and the nonmetric theory discussed in Section 2.4. 


2.5.1 Einstein Equivalence Principle in the THe jz formalism 


We begin by exploring in some detail the properties of the formalism as presented in 
Eq. (2.56). Later, we will apply it to the interpretation of experiments. 

In order to examine the Einstein Equivalence Principle in this formalism we must work 
in a local freely falling frame. But we do not yet know whether WEP is satisfied by the 
THe, theory (and we suspect that it is not, in general), so we do not know to which freely 
falling trajectories local frames should be attached. We must therefore arbitrarily choose a 
set of trajectories: the most natural choice is the set of trajectories of neutral test particles, 
that is, particles governed only by the first term in /yg, since their trajectories are universal 
and independent of the mass m,. We make a transformation to a coordinate system xe = 
(t, x), chosen according to the following criteria: (i) the origins of both coordinate systems 
coincide, that is, for a selected event P, x°(P) = x*(P) = 0, (ii) at P, a neutral test 
body has zero acceleration in the new coordinates, that is, d*x//di|p = 0, and in the 
neighborhood of P the deviations from zero acceleration are quadratic in the quantities 
Ax® = x* —x*(P), and (iii) the motion of the neutral test body is derivable from an action 
Jy. The required transformation, correct to first order in the quantities gp/ and go - X, both 
assumed small, is 


i= Ts ity x 
a oT," ; 
P 1/2 T) Ap 2 
=H, [e+ gteoe + TE (Orgy -x—ex)), (2.60) 


where the subscript (0) and superscript (’) on functions such as T, H, ¢, or denote 
To = T(x® =x*=0), %%=OT/O8|,0—,2=0, (2.61) 


and where gy = V®. The action Jyg in the new coordinates then has the form 


Inc = = Yom f =i) a+ Dee f Aviat 


t.. PRA Fle iS ay . 
Baer a #P)1i+-=(—*, |Tox- 
T g 760 (2) i T 5 Todt”? 080° * 
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Hy a5 1 T% ‘ 
B\l A “Xx 
To€o [40 b) (=, 080 


Ty Ho > eee 4e . A\2 
1 YTotgy- (é x B) d*x + [corrections O(x*)*] , (2.62) 
i €o}0 
where », = dx,/di, and 
_ Ox i) “ ee x 
The quantities (9, Ag, and Yo are defined by 
ane. % 
Tp= 2.64 
ae (2 "Oy Oy) on 
2T, ‘ T i 
ia (ee | (2.64b) 
T \Ho 27) 2H 
Toe 
To =1 - (2.64c) 


Note that our choice of the multiplicative factors 7, 5 * and Hh! * in the transformation 
equations (2.60) resulted in unit coefficients in the point-particle part of the action, making 
it look exactly like that of special relativity. 

Let us now examine the consequences for EEP of the physics governed by /yg. We 
first focus on the form of JNg at the event P (xd =f= 0), since local test experiments 
are assumed to take place in vanishingly small regions surrounding ?. Because such 
experiments are designed to be electrically neutral overall, we can assume that the E and 
B fields do not extend outside this region. Then at P, 


Ing =— Yims f = 0) df + Se f Anvil 


1 TA T Leg By Vas a. 
“aee(aa) {LP (ame) he ne 


We first see that, in general, Jyg violates Local Lorentz Invariance. A simple Lorentz 
transformation of particle coordinates and fields in yg shows that Ng is a Lorentz invariant 
if and only if 

A 


A 
To €0 /40 ae: 0 ~ Ge 288) 


Since we have not specified the event P, this condition must hold throughout the SSS 
spacetime. Notice that the quantity (7) "Hoey : Lo ')1/2 plays the role of the speed of light 
in the local frame, or more precisely, of the ratio of the speed of light cijgnt to the limiting 


speed co of neutral test particles, that is, 


A = oi (2.67) 
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Our coordinate transformation was chosen in such away that, in the local freely falling 
frame, co = 1; equivalently, in the original THe: coordinate system [see Eq. (2.56)] 


co = (To/Ho)'/”, —ciight = (opto) /?. (2.68) 


These speeds will be the same only if Eq. (2.66) is satisfied. If not, then the rest frame of 
the SSS field is a preferred frame in which /yg takes its THej. form, and one can expect 
observable effects in experiments that move relative to this frame. Thus, the quantity To 
plays the role of a preferred-frame parameter: if it is zero everywhere, the formalism is 
locally Lorentz invariant; if it is nonzero anywhere, there will be preferred-frame effects 
there. As we will see, the Hughes-Drever experiments and their descendents provide the 
most stringent limits on this preferred-frame parameter. 

Next, we observe that the action in Eq. (2.62) is locally position invariant if and only if 


T,\ 1/2 
€0 ( 2) = |constant, independent of P], 
0 
1/2 
Ti 
Lo (2) = |constant, independent of P]. (2.69) 
0 


Even if the theory is locally Lorentz invariant (Yo = 0, independent of P), there may 
still be location-dependent effects if the quantities in Eq. (2.69) are not constant. This 
would correspond, for example, to the situation discussed in Section 2.2, in which different 
parts of the local physical laws in a freely falling frame couple to different multiples 
on, 2, ... of the Minkowski metric; in this case, free particle motion coupling to 
7 itself, electrodynamics coupling to the position-dependent tensor n* = €7'/?H~!/2r in 
the manner given by the field Lagrangian (—17*)!/21*#n*”9F pvl’ag- The nonuniversality 
of this coupling violates EEP and leads to position-dependent effects, for example, in 
gravitational redshift experiments. An alternative way to characterize these effects in the 
case where Local Lorentz Invariance is satisfied is to renormalize the unit of charge and 
the vector potential at each event P according to 


Hy \'/4 Hy \~'/4 
* =e, | —— , AX =A,( —— : 2.70 
a (sr) " ‘ (=e) a) 


Then the action of Eq. (2.65) takes the form 


Inc = Yom f -82)'Pai+ Dre f Agviat 
1 px*2 p*2 “ 
+f [E -B |e. (2.71) 


This action has the special relativistic form, except that the physically measured charge e* 
now depends on location via Eq. (2.70), unless €9(Zp/Ho)'/? is independent of P. In the 
latter case, the units of charge can be effectively chosen so that everywhere in spacetime, 
€9(Ty/Ho)'/? =]; 

Note, however, that if LPI alone is satisfied [Eq. (2.69)], one can renormalize the charge 
and vector potential to make either €9(To/Ho)'/* = 1 or po(To/Ho)'/? = 1, but not 
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both, thus in general LLI need not be satisfied. Thus we see that a necessary and sufficient 
condition for both Local Lorentz Invariance and Local Position Invariance to be valid is 


€ = lo = (Ho/To)'”, for all events P. (2.72) 


These constraints are equivalent to those of metric theories of gravity, Eq. (2.59). 

Return to the terms in /yg, in Eq. (2.62) that depend on the first-order displacements + 
and ¢ from the event P. These occur only in the electromagnetic field part of the action, 
and presumably produce polarizations of the electromagnetic fields of charged bodies 
proportional to the external “acceleration” g, = V®. One would expect these polarizations 
to result in accelerations of composite bodies made up of charged test particles relative to 
the local freely falling frame (i.e., relative to neutral test particles), in other words, to result 
in violations of WEP. These terms are absent if [9 = Ap = Yo = 0, that is, if Eq. (2.69) 
holds. This discussion suggests that WEP alone may guarantee EEP. We now demonstrate 
this directly by carrying out an explicit calculation of the acceleration of a composite test 
body within the THe framework. The resulting restricted proof of Schiff’s conjecture was 
first formulated by Lightman and Lee (1973). 


2.5.2 Proof of Schiff’s conjecture 


We work in the global THe, coordinate system in which Jyg has the form of Eq. (2.56). 
Variation of jg with respect to the particle and field variables gives a complete set of 
particle equations of motion given by 


($».) 1VW=—4 | V (do +A) ~ S] (2.73) 


dt\w “ Moa 


where W = (T — Hv)!/?, Ay = Ao(Xa), A = A(Xq), and “gravitationally modified” 
Maxwell (GMM) equations, given by 


V - (cE) =47p, 
O(cE 
V x (u-'B) = 40d + a (2.74) 
where p = )~,€a0°(x — Xz), J = D>, ea¥ad?(x — Xq). These equations are used to 


calculate the acceleration from rest of an electromagnetically bound composite body 
consisting of charged point particles. A number of approximations are necessary to make 
the computation tractable. First, the functions T, H, € and jz, considered to be functions of 
® are expanded about the instantaneous center-of-mass location X = 0 of the composite 
body, in the form 


T(®) = Tyo + Mey -X + O[(gq +x)’, Veox’,...], (2.75) 


where 7p) = T(x = 0), Ty = dT/d®|,=o. As long as the body is small compared to the scale 
over which ® varies, we can work to first order in g)-x. Second, we assume that the internal 
particle velocities and electromagnetic fields are sufficiently small that we can expand the 
equations of motion and GMM equations in terms of the small quantities v> ~ e”/mor < 1 
where r is a typical interparticle distance. By analogy with the post-Newtonian expansion 
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to be described in Chapter 4, we call this a post-Coulombian expansion; for the purpose of 
the present discussion we will work to first post-Coulombian order. We expect the single- 
particle acceleration to contain terms that are O(go) (bare gravitational acceleration), O(v) 
(Coulomb interparticle acceleration), O(gov7) (post-Coulombian gravitational accelera- 
tion), O(v*) (post-Coulombian interparticle acceleration), O(gqv*) (post-post-Coulombian 
gravitational acceleration), and so on. To O(gov), we obtain 


dvq T) Hy > t,. 
Fan 2H®° y 2H” aT\ Pp Ay Va(¥o * Va) 
1/2 
€a Ty dA 
—|V(A 4: A)- —]|. 2.76 
ia. (Ap + Va - A) a (2.76) 


To write the equation of motion directly in terms of particle coordinates, we must obtain 
the vector potential A,,, in terms of coordinates and velocities to an appropriate order. In a 
gauge in which 
e'Agg —-V-A=0, (2.77) 
the GMM equations take the form 
V7Ao — €utAooo = 4e—!p — €! Ve - (WAp — Ap) 5 

V7A — EVA oo = —4rpid + (ew)! V (eu)(V -A)— Vex (Vx A). (2.78) 
These equations can be solved iteratively by writing A, = A®) + AY, where AY /A® ~ 
O(go) and solving for each term to an appropriate order in v’. The result is 


1 1, _ 
Ay = —@— 7 <oH0X,00 + 50% " (268 x — 8) VX). 


A=V+O(g0), (2.79) 


where 


CqVa 
V=10) ; (2.80) 


We insert the resulting single-particle acceleration into a definition of the acceleration of 
the body’s center of mass. It turns out that to post-Coulombian order, it suffices to use the 
simple center-of-mass definition 


X= mS ° moaXa» m= D> moa. (2.81) 


We then compute, d*X/dt*, substituting the single-particle equation of motion to the 
necessary order, and using the fact that at t = 0, X = dX/dt = 0. The resulting expression 
is simplified by the use of virial identities that relate internal, structure-dependent quantities 
to each other via total time derivatives of other internal quantities. As long as we restrict 
attention to bodies in equilibrium, these time derivatives can be assumed to vanish when 
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averaged over intervals of time that are long compared to internal timescales of the 
body. Errors generated by our definition of the center-of-mass similarly vanish. To post- 
Coulombian order, the required virial identity is 


_ 
‘ieee 5 He eg (2) =0, (2.82) 


where angular brackets denote a time average, and where 


ik — 1 jak na! ee 
T- = 5 Domovives Q = 5 Dieaes 7 3 (2.83) 


a,b ab 


where X45 = Xg —Xp; ab = |Xap|, and the double sum over a and b excludes the case a = b. 
The final result is 


ee =g'-To [=| gi+ To (= | gt, (2.84) 
where c} = To/Ho, 
gi= shel, (2.85) 
and 
ES ri ik ES ie kk 
ae A a ee (2.86) 


are the “binding energy” tensor and scalar of the body. The first term in Eq. (2.84) is g’, the 
universal acceleration of a neutral test body governed by the point particle action alone; the 
other two terms depend on the body’s electromagnetic self-energy and self-energy tensor. 
These terms vanish for all bodies (i.e., WEP is satisfied) if and only if, at any event P, 


=i, —a, (2.87) 


These conditions imply that €0(To/Ho)!/? = C and po (To /Ho)'/? = C_!, where C is 
a constant, but as before, a simple constant rescaling of charges and the vector potential 
using Eq. (2.70) sets C = | which is equivalent to Eq. (2.72), hence 


WEP= LPI and_ LLI, (2.88) 


and Schiff’s conjecture is verified, at least within the confines of the THe formalism. 
It is useful to define the gravitational potential U whose gradient yields the test-body 
acceleration g; modulo a constant, 


U(x) = —(Tp/2Ho)go -x. (2.89) 
If the functions 7, H, € and js are now considered as functions of U instead of ®, then Tp 


and Ao can be expressed in the useful forms 


Io = -35 In E (r/H)'”?] 


x= 


d 
Ao = In [H (T/H)"] (2.90) 
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2.5.3 Energy conservation and anomalous mass tensors 


Because the THe,: formalism is based on an action principle, it possesses conservation 
laws, and so is amenable to analysis using the conserved-energy framework described 
in Section 2.4. The main products of that framework are the anomalous inertial mass 
tensor 5mj* and passive gravitational mass tensor mk, obtained from the conserved 
energy. These two quantities then yield expressions for violations of WEP, Local Lorentz 
Invariance, and Local Position Invariance. 

As a concrete example (Haugan, 1979), we consider a classical bound system of two 
charged particles. As in the “proof” of Schiff’s conjecture we work to post-Coulombian 
order and to first order in gy-x. We first formulate the equations of motion in terms of a trun- 
cated action yg in which the field piece in Eq. (2.56) is dropped and the interaction piece 
is rewritten entirely in terms of particle coordinates by substituting the post-Coulombian 
solutions Eqs. (2.79) and (2.80) for A,,. Variation of Ing with respect to the particle coor- 
dinates then yields the complete particle equations of motion. We identify a Lagrangian L 
using the definition tna = s Ldt. It is convenient to rescale L by the factor T, 7 aes this 
has no effect on the dynamics but will simplify the subsequent analysis. We next make a 
change of variables in Z from x, x2, v; and v2 to the center-of-mass and relative variables 


x= My | (mx +x), V=dX/dt, 
X=xX;—-%x%, v=dx/dt, (2.91) 


where we define My = m, + mo, fs = mm2/Mo. We construct a Hamiltonian H from the 
rescaled L using the standard method: 


P/=0L/OV', p/=0L/Ov, H=P!Vi +p’ -L. (2.92) 
The result, to post-Coulombian order, is 


7 Pp ne? 
H= Moe} (1+ 526y-X) + ie ale 
0 


2To 2M 2 Hoe 7 
F T) A a Te e1e2 Xx 
[se Z| ee ep 
2p: Py +p?P?] Tyo ever [(m- PY + P| 
4 Mice A A 2M; 
+ O(Pp’) + O(p*) + O(P*), 


" iy Pp? 
=Mee, | 1 eX) 
a ( + 77,80 +a 


1/2 
ged | Plt a Ty” ever 
° Ho 2u 2To €0 Ae rE 


Pi pk jie e€1e2 (6 + nin’) 
2Mic* Ceol) 7 


(2.93) 
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where we define 


_ ) pr 
Mp = My +5? (Z zs ek (2.94) 
2u eH r 


The post-Coulombian terms O(p*) and O(P*) neglected in Eq. (2.93) do not couple the 
internal motion and the center-of-mass motion and thus do not lead to violations of EEP. 
We now average H over several timescales for the internal motions of the bound two- 
body system, assumed short compared to the timescale for the center-of-mass motion. 
This allows us to drop terms that are odd in internal vectors, such as terms of O(Pp*), or 
O(Ppe? /r) since such terms vanish on averaging over a system with reflection symmetry. 
The average is further simplified by using virial theorems analogous to Eq. (2.82); these 
are obtained from Hamilton’s equations for the internal variables derived from H, namely 


dink pl 
(? ’ ee (28 nin =e (2.95) 
a Ho€o r 

modulo post-Coulombian corrections. Notice that although the post-Coulombian correc- 
tions in Eq. (2.95) may depend on the center of mass variables P or X, this dependence 


does not affect the form of H: it is only the explicit dependence on P or X in Eq. (2.93) that 
generates the center-of-mass motion. Making use of Eqs. (2.64a) and (2.89) and defining 


1/2 
1 T, ee. «5 
FES = —— 9 (| = pip 2.96 
Bik 2 Hoe ’ i is : ( ) 
we obtain 
: 7 P2 EES ER Ok + EBS, \ PIP 
Ay = Vee — MU 4+ = ov = T i u i, 2.97 
Ma) icy = Mx Mp Oe . Mpc Mr ey) 


where now Mr = My — EES /c2. 
Using Vi = 0(H)/OP/, we can express the momentum P/ and the conserved energy 
Ec = (A) in precisely the form of Eqs. (2.34), with 
ER On + “) 
2 > 


a) 


dmj* = 2To ( (2.98a) 


EES ; 
dmp = 202-6". (2.98b) 
Cc 
0 
Substitution of these formulae into Eq. (2.44) for the general center-of-mass acceleration 
of a composite system yields precisely Eq. (2.84). 


2.5.4 Quantum systems and EEP 


One advantage of the Hamiltonian approach is that it can also be applied to quantum sys- 
tems (Will, 1974). This is especially useful in discussing gravitational redshift experiments 
since it is transitions between quantized energy levels that produce the photons whose 
redshifts are measured. For the idealized gravitational redshift experiments discussed in 
Section 2.4, only the anomalous passive mass tensor 5m} is needed. The simplest quantum 
system of interest is that of a charged particle (electron) moving in a given external 
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electromagnetic potential of a charged particle (proton) at rest in the SSS field, that is, 
a hydrogen atom. For such a system the truncated Lagrangian (2.56) has the form 


L=—m,.(To — Hyv’)!/2 —eA,v", (2.99) 
where mo = m, and e = |e| for the electron. We will ignore the spatial variation of T, H, ¢, 


and yu across the atom, hence we evaluate each at x = 0. The Hamiltonian obtained from L 
is given by 


H=T!? [m2 +H5'|p + eAP]'” + edo, (2.100) 


where p; = OL/Ov!. Introducing the Dirac matrices 


I 0 0 o F 1 0 
par La? a=(? a? AG ik (2.101) 


where J is the two-dimensional unit matrix and o are the constant Pauli spin matrices, we 
perform the “square root” in H, and obtain 


H= Ty? [m8 + Hye: (p + eA)| + edol. (2.102) 


The gravitationally modified Dirac equation is then H|7) = ih(O/0t)|q)). For most appli- 
cations it is more convenient to use the semirelativistic approximation to H obtained by 
means of a Foldy-Wouthuysen transformation, yielding 


1/2 |p + eAl* p* eh 
H=T 1m -B| +eA 
0 In + "DHom, — 8Hm2 | 2m.” ~ | * *° 
c Exp—linv xk) + _v E+ 0(p’) (2.103) 
4Hom i> SHom? 


where we have made the usual identification p —- —iiV and have ignored the effects 
of spatial variations in 7, H, €, or 44 on the atomic structure. For a charged particle with 
magnetic moment M, at rest at the origin, the vector potential as obtained from the GMM 
equations is given (to the necessary accuracy) by 


1 M,~x 
Aas. fag (2.104) 
€0r 2 BP 
The Hamiltonian then takes the form 
H=H, +H, +Hr+He+ Op’), (2.105) 
where 
H, = Ty!’me, 
1/2 
H 1 Dp e 


. Ho 2m €or’ 


pl 4 l he? 
[es ae ( a Jo, 


He 8m2 Heo \4m2r3 
1/2 
iE eh 
Ane = -B 2.10 
it = Fp (=) o-B, (2.106) 


where L = x xX p is the angular momentum of the electron. The four pieces of H are 
the usual rest mass (H;), principal (H,), fine-structure (Hs) and hyperfine-structure (Hp) 
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contributions. We have ignored the Darwin term («x V - E). The magnetic field produced 
by the proton is given by 


1 E —3n(n-M,) 


B=VxA=—519 : MB «| (2.107) 


r 
We must first identify the proton magnetic moment. From the hyperfine term Hp,, it is clear 
that the magnetic moment of the electron is given by 
T A > eh 
ae “Hy 2me : 
It is then reasonable to assume that the magnetic moment of the proton has the same 
dependence on 7 and Hp, so that 


(2.108) 


Ty) gpeh 
‘ a Op, (2.109) 
where g, is the gyromagnetic ratio of the proton and my, is its mass. Then 
1 polo ( 2pe7h? o,—3n(n-o,) 8r 
Ane = e° }6 : 2.110 
" 2 (Ge rP ae (*) ( ) 
Solving for the eigenstates of the Hamiltonian using perturbation theory yields 
H Hy \’ Hi 
E=7)? me+( *) & ! ( ”) Es4 ( ° +) &i ; (2.111) 
To€ To To 0 


where €), €s and Epp are the usual expressions for the principal, fine-structure, and 
hyperfine-structure energy levels in terms of atomic constants 7, e, Mp, Zp, h, and quantum 
numbers. In order to calculate the anomalous mass tensor 6m? as we must determine 
the manner in which E varies as the location of the atom is changed. Expanding E to 
first order in gy: X, substituting Eq. (2.89), converting to the conserved energy function 
Ec= Ce /Ho), defining the binding energy £9, = EES + EE + EEF, where 


ER =-6°&, ER=—HoTy'eo*E, Ep’ =—boe *Enr, (2.112) 
we obtain Eq. (2.34) (with V = 0), with 


EE ES 
bm," = OT) —* 6F, 
“ 


om Oe — =4T Fa sik 
C 
HF 


E 
5mUF VE — (3.Aq — Tp) —8-5*. (2.113) 
cS 
Compare 5m Sk here with Eq. (2.98b). 
A useful fact that emerges from the solution for the energy eigenstates is that the Bohr 
radius is given by 
pi 


€0 he 
a= a (= =) (2.114) 


This is important in analyzing the gravitational redshift of microwave or optical cav- 
ities. From the quantization of the electromagnetic field in the 7Heu formalism, it is 
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straightforward to see that the energy of the electromagnetic field in a cavity is proportional 
to fiw, where w is the angular frequency of the wave mode in the cavity, and where 
the dispersion relation implied by the GMM equations implies that |k| = (€gpi9)!/2w. 
The wavenumber & must satisfy A- L = nz for a stationary mode, where |L| is the length 
of the cavity. But L is proportional to an integer number of atoms times the interatomic 
spacing, whose fundamental scale is the Bohr radius. But Eq. (2.114) shows that this scales 
as Th!” €9/Ho. Thus |A| « Ht. and the energy of the mode can be written as 


A 


E= _cavity > 
1/2 3/2 1/2 
qT 4 eq by 


(2.115) 


where Egayity depends only on atomic constants and integers. Expanding in terms of gy - x 
and calculating the conserved energy function Ec, we obtain Eq. (2.34), with 


cavity =L/2 =3/2 

E, = —Ceavity Ug ne / ’ 

5 (cavity )jk 1 neg ik 

m\ = 5 (3Fo + Ao) 2-8", (2.116) 
0 


2.5.5 Local Lorentz Invariance and the 2 formalism 


If we wish to focus attention on tests of local Lorentz Invariance then the THe: formalism 
can be simplified (Haugan and Will, 1987; Gabriel and Haugan, 1990). Since most such 
tests do not concern themselves with the spatial variation of the functions 7, H, ¢, and pu, 
but rather with observations made in moving frames, we can treat them as spatial constants, 
so that 7, = Hj) = & = by = 0. Then by rescaling the time and space coordinates using 
Eqs. (2.60), and the charges and the electromagnetic fields using Eqs. (2.70), we can put 
the action (2.56) into the form 


Ing =— Ym f (1 - v2) at + Se f Anvtae 


1 


8ir 


{EB — °B’} &s, (2.117) 


where c? = Ho/(To€op0) = (1 — Yo) ~!. This amounts to using units in which the limiting 
speed co of massive test particles is unity, and the speed of light is c. If c # 1, LLI is 
violated, and the form of the action above must be assumed to be valid only in some 
preferred universal rest frame. The natural candidate for such a frame is the rest frame of 
the microwave background. 

The electrodynamic equations that follow from Eq. (2.117) yield the behavior of rods 
and clocks, just as in the full THe formalism. For example, the length of a rod that moves 
with velocity V relative to the rest frame in a direction parallel to its length will be observed 
by a rest observer to be contracted relative to an identical rod perpendicular to the motion 
by a factor 1 — V7/2 + O(V*). Notice that c does not appear in this expression, because 
only electrostatic interactions are involved in determining the interatomic spacing in the 
rod, while c appears only in the magnetic sector of the action. 
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The energy and momentum of an electromagnetically bound body moving with velocity 
V relative to the rest frame are given by 


P! = MV! + bmi, 
1 Fo 
Eo = Mace’ + MV 4 som VV", (2.118) 


where Mg = My — EES, 
Sm)" = 2To (EBS Sn + ESS) 5 (2.119) 


and where EB is given by Eq. (2.96), with EES = OER. 
Note that To corresponds to the parameter do in Eq. (2.17) and plotted in Figure 2.3. 


2.5.6 Application to tests of EEP 


We now apply the THe, framework to some of the specific experimental tests of EEP 
described in Section 2.3. Most such experiments occur in Earth-bound or near-Earth 
orbiting laboratories, and they tend to single out the gravitational field of one body, the 
Earth or the Sun as the relevant field driving the phenomenon being studied. These fields 
are spherically symmetric to a high degree of accuracy, and because the timescales over 
which these field change are long compared to the internal timescales of the atoms that 
make up the apparatus being used, they can be treated as static. 


Tests of WEP 


Equation (2.84) gives the acceleration of a composite body through post-Coulombian 
order in an external SSS field. When specialized to composite bodies that are spherical 
on average (a good approximation for experimental situations), the resulting acceleration 


is given by 
VX He 8 
— =231+-4 (20-57 2.12 
a «| + a ( oe od. (2.120) 
where ie is given by Eq. (2.86). Then the Eétvés ratio 7° defined in Eq. (2.6) is given by 
PS = 209 — sYo, (2.121) 


permitting us to translate experimental bounds summarized in Figure 2.2 into bounds on 
parameters of the THe framework. 

Haugan and Will (1977) extended the calculation of the acceleration to “post-post- 
Coulombian” order, finding violations of WEP dependent on “magnetostatic” energies pro- 
portional to (e,¢5/Tap)VaVy, with amplitudes dependent upon Ag as well as on Tp and Yo. 


Tests of LLI 


Here it is natural to use the c* formalism, and to let the velocity V = w, the velocity of the 
laboratory relative to the cosmic background radiation. 
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From the quantized electrodynamic sector of the action (2.117) the energy of a photon 
is iw, while its momentum is fiw/c. Using this approach, and working in the presumed 
universal preferred frame, one finds that the difference in round trip travel times of light 
along the two arms of the interferometer in the Michelson-Morley experiment is given 
by YoLo(w?/c), where Lo is the length of the arms Unlike the standard special relativistic 
case, the Lorentz-Fitzgerald contraction of the parallel arm («x 1—w?/2) does not precisely 
cancel the time difference along the two paths because the latter depends on c, while the 
former does not. 

The behavior of moving atomic clocks can be analyzed in detail, and bounds on Yo can 
be placed using results from tests of time dilation and of the propagation of light. In some 
cases, it is advantageous to combine the c” framework with a “kinematical” viewpoint that 
treats a general class of boost transformations between moving frames. Such kinematical 
approaches have been discussed by Mansouri and Sexl (1977a,b,c), MacArthur (1986) and 
Will (1992a). 

For example, in the “JPL” experiment (see Figure 2.3), in which the phases of two 
hydrogen masers connected by a fiberoptic link were compared as a function of the Earth’s 
orientation, the predicted phase difference as a function of direction is, to first order in w, 
the velocity of the Earth through the cosmic background, 

OD iin tn aa (2.122) 

@ 3 

where @ = 27vL, vis the maser frequency, L = 21 kmis the baseline, and # and mg are unit 
vectors along the direction of propagation of the light at a given time and at the initial time 
of the experiment, respectively. The observed limit on a diurnal variation in the relative 
phase resulted in the bound |d9| = |Yo| < 3 x 1074. Tighter bounds were obtained from 
a “two-photon absorption” (TPA) experiment, and a 1960s series of “Méssbauer-rotor” 
experiments, which tested the isotropy of time dilation between a gamma ray emitter on 
the rim of a rotating disk and an absorber placed at the center (Will, 1992a). 

Similarly the anisotropy in energy levels is clearly illustrated by the tensorial terms in 
Egs. (2.118). To illustrate this in a concrete setting, we will consider the Hughes-Drever 
experiment (Haugan, 1978). The anisotropic term in Eq. (2.118) could lead to energy 
shifts of states having different values of dm) and thus to observable effects in a quantum 
mechanical transition between these states. In the case of the Hughes-Drever experiment, 
the system, a ’Li nucleus, can be approximated as a two-body system consisting of J = 0 
core (two protons and four neutrons) of charge +2, and a valence proton in a ground state 
with angular momentum of |. The spin of the proton couples to its angular momentum to 
yield a total angular momentum J = 3/2. In an applied magnetic field, the four magnetic 
substates my = +1/2, +3/2 are normally split equally in energy, giving a singlet emission 
line for transitions between the three pairs of states. How does the anistropic term alter the 
energies of these four states? The isotropic part of dm/" oc EES6/* in Eq. (2.119) simply 
shifts all four levels equally, since (JMj\e1e2r~ '||.JM,) is independent of M;. However, the 
other contribution to dm} od Eee , does shift the levels unequally. We first decompose 
w/ into a component wy , parallel to the applied magnetic field and a component w_, 
perpendicular to it. Then 
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ik 

wih = : (Wj cos’6 + 2w wy cos @ sin @ sin @ + wie sin? sin’) ; (2.123) 
where @ and ¢ are polar coordinates appropriate to the orbital wave function ¢,, = 
I(r) Yim(6, @) in a basis where the magnetic field is in the z-direction. By combining the 
orbital wave function and spin states into eigenstates of total J, M;, we then calculate 
the expectation value of (1/ 2)dmi*wiwk in states of different M). Taking the difference 
in the energy shifts between adjacent M,, states, we find that the singlet line splits into a 


triplet with relative energies 


hv (3/2 > 1/2) =6, 
hv(1/2 + —1/2) =0, 


h=1f2 5 3p) =, (2.124) 
where 
2 ERS 2 2 
§= 51S (wi. - wi) (2.125) 


(recall that in the units of the c? formalism, c is a dimensionless number, presumably close 
to or equal to unity). In the notation of Eq. (2.14) we can write that 


2 
5 = To (w. — wi) (2.126) 
By assuming that w = 368 km s_', wecan infer the bound on Y or 69 shown in F igure 2.3, 
for the Hughes-Drever experiments. For the later improved experiments using cold trapped 
atoms, analogous calculations must be done for each specific atomic species used in order 
infer the bounds shown. 


Tests of LPI 


Consider gravitational redshift experiments. Suppose, for example, that one measures 
the gravitational redshift of photons emitted from various transitions of hydrogen, such 
as principal transitions, fine-structure transitions within a principal level, a hyperfine 
transition such as the 21 cm line, the basis for hydrogen maser clocks, or of a photon from 
an electromagnetic cavity. Then, substituting Eqs. (2.113) and (2.116) into Eq. (2.47), we 
obtain (Will, 1974) 


Z, = [1 — 20] AU/cG, 
Zp = (1 — 410] AU/cG, 
Zi = (1 — (320 — Ao)| AU/co, 
1 
Zeavity = [ 5 (30 + An)| AU/e. (2.127) 


Notice that the four redshifts are different in general. Thus the gravitational redshift 
depends on the nature of the clock whose frequency shift is being measured unless 9 = 
Ao = 0, that is, unless LPI is satisfied. From the hydrogen-maser Gravity Probe A experi- 
ment or the SAO-Stanford microwave cavity/H-maser comparison shown in Figure 2.4, it 
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is straightforward to infer bounds on I’p and Apo. In order to infer bounds on these param- 
eters from the advanced null redshift experiments shown in Figure 2.4 that involve clocks 
based on Rubidium, Cesium, Dysprosium, or Aluminium, one must carry out more sophis- 
ticated atomic physics calculations in order to establish the analogues of Eqs. (2.127). 


2.6 The Standard Model Extension 


Despite their utility, the THej and c* formalisms are restricted to electromagnetic 
interactions. What about the other nongravitational interactions—the strong and the weak 
interactions? 

Kostelecky and collaborators developed an important framework for discussing viola- 
tions of Local Lorentz Invariance in the context of the full Standard Model of particle 
physics (Colladay and Kostelecky, 1997, 1998; Kostelecky and Mewes, 2002). Called the 
Standard Model Extension (SME), it takes the standard SU(3) x SU(2) x U(1) field theory 
of particle physics, and modifies the terms in the action by inserting a variety of tensorial 
quantities in the quark, lepton, Higgs, and gauge boson sectors that could explicitly violate 
LLL SME extends the THe and c* frameworks, and a related “x — g” framework of Ni 
(1977) to quantum field theory and particle physics. The modified terms split naturally into 
those that are odd under CPT (i.e., that violate CPT) and terms that are even under CPT. 
The result is a rich and complex framework, with hundreds of parameters to be analyzed 
and tested by experiment. Such details are beyond the scope of this chapter; for a review 
of SME and other frameworks, the reader is referred to the Living Review by Mattingly 
(2005) or the review by Liberati (2013). The review of the SME by Kostelecky and Russell 
(2011) provides “data tables” showing experimental bounds on all the many parameters of 
the SME. 

Here we confine our attention to the QED sector, in order to illustrate some features of 
the SME and to link it with the c? framework discussed in Section 2.5.5. In a simplified 
version of the SME that preserves CPT, the action for an electron interacting with an 
electromagnetic field takes the form J = [ Ld'x, with 


7 ys Vv 1 (oI va 
L = hp + Pi ln + eu] Dd — 5 [nhone? +e? | Fy Fas, (2.128) 


where D! = 0" + ieA", y# are gamma matrices, c,,,, is a real tracefree tensor, and ky aaa 
is a tensor with the symmetries of the Riemann tensor, and with vanishing double trace. In 
this simple model there are 15 + 19 = 34 free parameters in principle. 

The tensor jer can be decomposed into “electric,” “magnetic,” 
components, by defining 


zy 


and “odd-parity” 


jk Oj0k 
KDE = —2k¢ 5 

1, : 
Ls — =. jpq krs ,pqrs 
Kop = 5¢ € i ‘ 


aa ae (2.129) 


2.6 The Standard Model Extension 


In many applications it is useful to use the further decomposition 


l . 
Retr = 3 be 
RK = ve + Kyp)/ 
e+ = 5 (Spe + Kup)" 
i 1 
Al = 5 (KDE Kup)” oF Kb, 
wie — 1 i 
iy = 3 (koe + Kure)", 
ik 1 . 
Rj. = (moa — wns) (2.130) 


The first expression is a single number, the next three are symmetric tracefree matrices, 
and the final is an antisymmetric matrix, accounting thereby for the 19 components of the 
original tensor ky sae 

In the rest frame of the universe, these tensors have some form that is established by the 
global nature of the solutions of the overarching theory being used, and could in principle 
be anisotropic. In a frame that is moving relative to the universe, the tensors will have 
components that depend on the velocity of the frame, and on the orientation of the frame 
relative to that velocity, thus leading to violations of LLI. 

However, since the observable universe is isotropic to a high degree of accuracy (parts 
in 10°), then it is not unreasonable to assume that, in whatever theory is responsible for the 
Lorentz violation, in the rest frame of the cosmic microwave background, the tensors c,,,, 
and kf v8 should be isotropic, so that Cie Sik i x of i = 0, and ee ox Hlk5y, 
Combining this with the tracefree constraint, it is simple to show that the tensors can be 
expressed in the general form, valid in any frame moving with four-velocity uv" relative to 
the preferred frame: 


1 
Cuv = Cc Caz + i) > 
River — (4ultyllow = nfl”) , (2.131) 


Working in the preferred frame, we write out the Lagrangian explicitly in terms of temporal 
and spatial derivatives and in terms of electric and magnetic fields, using E; = Fy, 
B; = 36xuFu. We then rescale the coordinates by t = ai, x/ = (x, which induces 
the transformations D° = aD?, Di = BDI, E = (aB8)—'E and B = £~2B. Finally, 
we rescale the electric charge and the vector potential by e = e*(1 + ftr)!/?/(a8), and 
Ag = A%a8/(1 + Rer)!/?. Choosing a = (1 — 3/4)~! and 6 = (1 + ¢/4)~', we put the 
Lagrangian (2.128) into the form (after dropping hats and asterisks), 


: - 1 
L= dm + dinuy!D’y — 5 (E? — c°B’) , (2.132) 


which is the Lagrangian in the c” framework, with 


~ \1/2 , 
1 — Ker 1+¢/4 

= 2.133 

. (=) (77) cee 
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To first order in the small parameters &,, and c, we then have 


4 
To = —2(Ftr — 2) = 2 (Fe seo) (2.134) 


where coo is the 00 component of c,,, in the preferred frame. The latter expression 
corresponds to the parameter combination listed in table VII of Kostelecky and Russell 
(2011). 


2.7 EEP, Particle Physics, and the Search for New Interactions 
—— SSS SS SSS SSS SS SS SS ay 


Thus far, we have discussed EEP as a principle that strictly divides the world into metric 
and nonmetric theories, and have implied that a failure of EEP might invalidate metric 
theories (and thus general relativity). On the other hand, there is theoretical evidence to 
suggest that EEP could be violated at some level, whether by quantum gravity effects, by 
effects arising from string theory, or by hitherto undetected interactions. Roughly speaking, 
in addition to the pure Einsteinian gravitational interaction, which respects EEP, theories 
such as string theory predict other interactions which do not. In string theory, for example, 
such EEP-violating fields are known to exist. However, the theory is unable to provide 
a robust or unique prediction of their strength relative to gravity, or a determination 
of whether they are long range, like gravity, or short range, like the nuclear and weak 
interactions, and thus too short-range to be detectable, although numerous toy-model 
calculations exist. For examples of specific models, see Taylor and Veneziano (1988) and 
Damour and Polyakov (1994) Another class of nonmetric theories is the “varying speed of 
light (VSL)” set of theories; for a detailed review, see Magueijo (2003). 

On the other hand, whether one views such effects as a violation of EEP or as effects 
arising from additional “matter” fields whose interactions, like those of the electromagnetic 
field, do not fully embody EEP, is to some degree a matter of semantics. Unlike the fields 
of the standard model of electromagnetic, weak and strong interactions, which couple to 
properties other than mass-energy and are either short range or are strongly screened, 
the fields inspired by string theory or other theories could be long range (if they remain 
massless by virtue of a symmetry, or at best, acquire a very small mass), and can couple to 
mass-energy, and thus can mimic gravitational fields. Still, there appears to be no way to 
make this precise. 

As a result, EEP and related tests are now frequently viewed as ways to discover or 
place constraints on new physical interactions, or as a branch of “nonaccelerator particle 
physics,” searching for the possible imprints of high-energy particle effects in the low- 
energy realm of gravity. Whether current or proposed experiments can actually probe these 
phenomena meaningfully is an open question at the moment, largely because of a dearth 
of firm theoretical predictions. For example, notwithstanding the impressive experimental 
bounds on the many parameters of the SME, no reasonable theory is known to have been 
killed by those constraints. 
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The “fifth” force 


On the phenomenological side, the idea of using EEP tests in this way originated in the 
middle 1980s, with the search for a “fifth” force. In 1986, as a result of a detailed reanalysis 
of E6tvés’ original data, Fischbach et al. (1986) suggested the existence of a fifth force of 
nature, with a strength of about a percent that of gravity, but with a range (as defined by 
the range \ of a Yukawa potential, « r~!e~"/>) of a few hundred meters. This proposal 
dovetailed with earlier hints of a deviation from the inverse-square law of Newtonian 
gravitation derived from measurements of the gravity profile down deep mines in Australia, 
and with emerging ideas from particle physics, notably kaon decay, suggesting the possible 
presence of very low-mass particles with gravitational-strength couplings. 

During the next four years numerous experiments looked for evidence of the fifth 
force by searching for composition-dependent differences in acceleration, with variants 
of the Edtvés experiment or with free-fall Galileo-type experiments. Although two early 
experiments reported positive evidence, the others all yielded null results. Over the range 
between one and 10* meters, the null experiments produced upper limits on the strength 
of a postulated fifth force between 10-7 and 10~° of the strength of gravity. Interpreted 
as tests of WEP (corresponding to the limit of infinite-range forces), the results of two 
representative experiments from this period, the free-fall Galileo experiment and the early 
Eot-Wash experiment, are shown in Figure 2.2. The associated gray region in that figure 
is meant to encapsulate the many experimental bounds reported during that period. At 
the same time, tests of the inverse-square law of gravity were carried out by comparing 
variations in gravity measurements up tall towers or down mines or boreholes with 
gravity variations predicted using the inverse square law together with Earth models and 
surface gravity data mathematically “continued” up the tower or down the hole. Despite 
early reports of anomalies, independent tower, borehole, and seawater measurements 
ultimately showed no evidence of a deviation. Analyses of orbital data from planetary 
range measurements, lunar laser ranging (LLR), and laser tracking of the LAGEOS 
satellite verified the inverse-square law to parts in 10° over scales of 10°—10° km, and 
to parts in 10° over planetary scales of several astronomical units Talmadge et al. (1988). 
A consensus emerged that there was no credible experimental evidence for a fifth force of 
nature, of a type and range proposed by Fischbach et al. For reviews and bibliographies of 
this episode, see Fischbach et al. (1992), Fischbach and Talmadge (1992), Fischbach and 
Talmadge (1999), Adelberger et al. (1991), and Will (1990). 


Short-range modifications of Newtonian gravity 


Although the idea of an intermediate-range violation of Newton’s gravitational law was 
dropped, new ideas emerged to suggest the possibility that the inverse-square law could be 
violated at very short ranges, below the centimeter range of existing laboratory verifications 
of the 1/ r” behavior. One set of ideas (Antoniadis et al., 1998; Arkani-Hamed et al., 1998; 
Randall and Sundrum, 1999a, 1999b) posited that some of the extra spatial dimensions that 
come with string theory could extend over macroscopic scales, rather than being rolled up 
at the Planck scale of 10~*? cm, which was then the conventional viewpoint. On laboratory 
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distances large compared to the relevant scale of the extra dimension, gravity would fall 
off as the inverse square, whereas on short scales, gravity would fall off as 1/7°+”, where 
n is the number of large extra dimensions. Many models favored n = 1 orn = 2. Other 
possibilities for effective modifications of gravity at short range involved the exchange of 
light scalar particles. 

Following these proposals, many of the high-precision, low-noise methods that were 
developed for tests of WEP were adapted to carry out laboratory tests of the inverse 
square law of Newtonian gravitation at millimeter scales and below. The challenge of 
these experiments was to distinguish gravitation-like interactions from electromagnetic 
and quantum mechanical (Casimir) effects. No deviations from the inverse square law 
have been found to date at distances between tens of nanometers and 10 mm (Long et al., 
1999; Hoyle et al., 2001, 2004; Chiaverini et al., 2003; Long et al., 2003; Kapner et al., 
2007; Adelberger et al., 2007; Tu et al., 2007; Geraci et al., 2008; Sushkov et al., 2011; 
Bezerra et al., 2011; Yang et al., 2012; Klimchitskaya et al., 2013). For a comprehensive 
review of both the theory and the experiments circa 2002, see Adelberger et al. (2003). 


The Pioneer anomaly 


In 1998, Anderson et al. reported the presence of an anomalous deceleration in the motion 
of the Pioneer 10 and 11 spacecraft at distances between 20 and 70 astronomical units 
from the Sun. Although the anomaly was the result of a rigorous analysis of Doppler data 
taken over many years, it might have been dismissed as having no real significance for new 
physics, where it not for the fact that the acceleration, of order 10~? m/ s°, when divided 
by the speed of light, was strangely close to the inverse of the Hubble time. The Pioneer 
anomaly prompted an outpouring of hundreds of papers, most attempting to explain it via 
modifications of gravity or via physical interactions beyond the standard model of particle 
physics, with a small subset trying to explain it by conventional means. 

Soon after the publication of the initial Pioneer anomaly paper, Katz (1999) pointed 
out that the anomaly could be accounted for as the result of the anisotropic emission 
of radiation from the radioactive thermal generators (RTG) that continued to power the 
spacecraft decades after their launch. At the time, there was insufficient data on the 
performance of the RTG over time or on the thermal characteristics of the spacecraft to 
justify more than an order-of-magnitude estimate. However, the recovery of an extended 
set of Doppler data covering a longer stretch of the orbits of both spacecraft, together 
with the fortuitous discovery of project documentation and of telemetry data giving on- 
board temperature information, made it possible both to improve the orbit analysis and to 
develop detailed thermal models of the spacecraft in order to quantify the effect of thermal 
emission anisotropies. Several independent analyses now confirm that the anomaly is 
almost entirely due to the recoil of the spacecraft from the anisotropic emission of residual 
thermal radiation (Rievers and Lammerzahl, 2011; Turyshev et al., 2012; Modenini and 
Tortora, 2014) For a thorough review of the Pioneer anomaly published just as the new 
analyses were underway, see the Living Review by Turyshev and Toth (2010). 


Gravitation as a Geometric Phenomenon 


The overwhelming empirical evidence for the Einstein Equivalence Principle that we 
presented in Chapter 2 supports the conclusion that the only theories of gravity that have 
a hope of being viable are metric theories, or possibly theories that are metric apart from 
very weak or short-range non-metric couplings (as in string theory). Therefore, for the 
remainder of this book, we will turn our attention exclusively to metric theories of gravity. 
In Section 3.1, we review the concept of universal coupling, first defined in Section 2.2 
and restate the postulates of metric theories of gravity. We then develop, in Section 3.2, 
the mathematical equations that describe the behavior of matter and nongravitational 
fields in curved spacetime. Every metric theory of gravity possesses these equations. 
Metric theories of gravity differ from each other in the number and type of additional 
gravitational fields they introduce and in the field equations that determine their structure 
and evolution; nevertheless, the only field that couples directly to matter is the metric itself. 
In Section 3.3, we discuss general features of metric theories of gravity, and present an 
additional principle, the Strong Equivalence Principle that is useful for classifying theories 
and for analyzing experiments. 


3.1 Universal Coupling 
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The validity of the Einstein Equivalence Principle requires that every nongravitational field 
or particle should couple to the same symmetric, second rank tensor field of signature —2. 
In Section 2.2, we denoted this field g, and saw that it was the central element in the 
postulates of metric theories of gravity: (1) there exists a metric g, (i1) test bodies follow 
geodesics of g, and (111) in local Lorentz frames, the nongravitational laws of physics are 
those of special relativity. 

The property that all nongravitational fields should couple in the same manner to a single 
gravitational field — universal coupling — allows one discuss the metric g as a property of 
spacetime itself rather than as a field over spacetime. This is because its properties may 
be measured and studied using a variety of different experimental devices, composed of 
different nongravitational fields and particles, and because of universal coupling, the results 
will be independent of the device. Consider, as a simple example, the proper time between 
two events as measured by two different clocks. To be specific, imagine a Hydrogen 
maser clock and an electromagnetic cavity clock at rest in a static spherically symmetric 
gravitational field. If each clock is governed by a Hamiltonian H, then the proper time 
(number of clock “ticks”) between two events separated by coordinate time df is given by 
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N=vdt=h"'Edt, (3.1) 


where F is the eigenstate energy of the Hamiltonian (or energy difference, for a transition). 
The results of Section 2.5.4 show that if, for instance, the THe, formalism is applicable, 
and if EEP is satisfied, then €) = fo = (Ho/To)!/? everywhere, thus using Eqs. (2.111) 
and (2.115) we obtain for each clock 


Nu x dt(HoTy ” Woe9?) = Ty! at, 
Newity % dt(HoT> 1/2 jig? Pare rat, (3.2) 


where the proportionality constants are fixed by calibrating each clock against a standard 
clock far from gravitating matter. Thus, each clock measures the same quantity 7p (in 
metric theories of gravity, in SSS fields, 7) = —goo) and the proper time between two 
events is a characteristic of spacetime and of the location of the events, not of the clocks 
used to measure them. 

Consequently, if EEP is valid, the nongravitational laws of physics may be formulated by 
taking their special relativistic forms in terms of the Minkowski metric 77 and simply “going 
over” to new forms in terms of the curved spacetime metric g, using the mathematics of 
differential geometry. The details of this “going over’ are the subject of the next section. 


3.2 Nongravitational Physics in Curved Spacetime 


3.2.1 Charged point-particle dynamics 
In local Lorentz frames, the nongravitational laws of physics are those of special relativity. 


For charged point test particles coupled to electromagnetic fields, for example, these laws 
may be derived from the action 


dxtt dx® 
b= Tm f (nel) “ane fast 


a Pail? FaoF a 3(—H) die, (3.3) 
where 
Fao = Aop —Apgp, 
fy = det(npao). (3.4) 
Here, 77 is the Minkowski metric, which in Cartesian coordinates has the form 
Nao = diag(—1, 1, 1, 1). (3.5) 


In the local Lorentz frame, 7,7 is assumed to have this form only up to corrections of order 
[x* — x°(P)]*, where x°(P) is the coordinate of a chosen fiducial event P in the local 
frame, in other words, 7a is described more precisely as 
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(3.6) 


According to the discussion in Section 3.1, the general form of these laws in any frame is 
obtained by a simple coordinate transformation from the freely falling frame to the chosen 
frame. This transformation is given by 


xP = xP (x) | (3.7) 


, fi 
Fr le ee (3.8) 
Ox® 
D% = Hx*,x? \d*x, (3.9) 
Ox® OxP 
Nao ~~ Oxi Ox? Nap ’ (3.10) 
Ox® 
Ap = 5G Aa: (3.11) 


where J is the Jacobian of the transformation. Partial derivatives of fields, as for example 
in the formula for Fz», transform according to 


Ox? A (Ax® Ox® Ax? 2% 
ee, (3.12) 
Ox? Ox® \ Axh Ox Ox? Oxhix? 
However, in the local frame, 7,9, = 0. Thus, 
0 Ox Ax? Ax? x OPx® AxF . Ox? Ax GB.13 
= hi, = Aa ~ Dy lo aa lo Dw ~ lap - : 
Ie, Ox! Ox” Ox” NoBey OxPX® Ox? ue Ox? x” Axl oe ) 
Using the fact that 
Ox” Ax! 
—~—— = 6 3.14 
Ox” Ox® we ele) 
we obtain 
7 Ox! Ax® Gx ; Ox” Ax® Ax (3.15 
a aa Ox® OxV OxPx# i Ox? Ox Ox? x” we Amy 


If we now define 77 in the new coordinates to be the metric for a nonlocal Lorentz frame, 


Sa8 = Nas» (3.16) 
Ox On? 2s 

ab =) po I 

gs" = |lgasll” =aa ae > (3.17) 
Ox! Ax” Ox 

T= = al 

By OxP Ax Oxhix? ont) 
then Eq. (3.15) can be written 

Sop =To,g58 +} yBas- (3.19) 


Permuting the a, ( and y indices to generate two additional versions of this equation, 
and making use of the symmetry of 13, on the lower indices, which follows from 


64 


Gravitation as a Geometric Phenomenon 


Eq. (3.18), and combining the three equations suitably, we obtain the standard formula 
for the Christoffel symbols ay (also known as connection coefficients) 


a 1 OL 
Ty = 78° (8667 + 861.8 — 88.8) - (3.20) 


Then Eq. (3.12) becomes 


Ox™ OxP 2. 
Apo —_ Ox Axe (4a ~_ i Ay) . (3.21) 


We define the covariant derivative “;” by 


Ao: = Aaa — 11547, (3.22) 


and notice that it transforms as a tensor; it can be shown that 


A*.p =A g +T5eA, (3.23) 
where A® = g®° Ag. Taking the determinant of Eq. (3.10) yields 
n = [det(Ox*/dx")]’g, (3.24) 
where g = det(g,,,). Recalling that det(Ox%/Ox") = J(x",x%)—!, we obtain 
ie 
fi yo) — 
I(x! x%) = Cala” (3.25) 


Substituting these results into /yg gives 


2 


dxtt dx¥ Me 
Inc = Ym. | (eS) d+ Sen f Autor 


_ lon ghegh® F Fop(—g)'/?d'x, (3.26) 


where 


| eae Renin, ern (3.27) 


We notice that the transformation from a local inertial frame to an arbitrary frame has 
resulted simply in the replacements 


Nuv by Sy 
“comma” by “semicolon” (3.28) 
(—n)'?d4x by (—g)!/?d4x. 


This is the mathematical manifestation of EEP. We must point out that the specific 
mathematical forms given for the Christoffel symbols, transformation laws, and so on are 
valid only in coordinate bases (see MTW, chapter 10, for further discussion). However, in 
this book we will work exclusively in coordinate bases. 

Generally speaking, then, the procedure for implementing EEP is: put the local special 
relativistic laws into a frame-invariant form using Lorentz-invariant scalars, vectors, 
tensors, and so on, then make the above replacements. It is simple to show that the same 
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rules apply to the field equations and equations of motion derived from the Lagrangian. In 
the local frame they are 


d(ur)a , 
mo, tie = eF io (u" Ja 5 
FRY, = An JP, (3.29) 


where dr = (—1navdx"dx”)!/?, ul" = dx /dr, and the current is given by 


dxf 
Pa) ee z J. (3.30) 


However, these are not in a manifestly frame-invariant form. We must instead write them 
in the equivalent forms dug /dr = u’ up, and 


. fi 
ye = De fw —1/254(y 4 By ar, (3.31) 
ae 


where we have used the fact that, for the four-dimensional delta function, (—7#)~!/75* is 
invariant (since [ 6+d*x = 1 or 0 regardless of the frame). Then in the general frame the 
equations are 


D(u a Vy 
Moa ie = CaF wp (u Ja > 
Fe = And, (3.32) 


where dr = (—g,,,dx"dx”)!/?, ul = dx" /dr, 


Diu a Vy 
fs SU Uys (3.33) 
and the current is given by 
Z dx! 
Jt =)" ea(—g) 2-9°(x— xa) a (3.34) 


a 


However, there is a potential ambiguity in the application of EEP to electrodynamics 
if one writes Maxwell’s equations (3.32), in terms of the vector potential 4,,. In the local 
Lorentz frame, Maxwell’s equations have the special relativistic form 


AP — Ale = Ans (3.35) 


It is always possible to choose a gauge (Lorenz gauge) in which A‘, = 0, thus, since 
A®li , = APS", we have 


aA® = AM 5 = An, (3.36) 


It is tempting then to apply the rules of EEP to this equation to obtain 


AH = AMY = pYA 4H = Ar, 
t=O. (3.37) 
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However there is another alternative. The curved-spacetime Maxwell equation, Eq. (3.32), 
yields 


AMY. — AXH, = —4 J (3.38) 


But covariant derivatives of vectors and tensors do not commute in curved spacetime, in 
fact in general 


A* gq, = A¥ ag + Rept’ » (3.39) 
where Ri". g 1s the Riemann curvature tensor, given by 
Bo pe pe Y _ pep 
Reap = Tipe ~ oe + | More Laer ee (3.40) 
Then 
AH, = AY 3H + REAP, (3.41) 


where R’ is the Ricci tensor, given by 


R =e" Rigs Rug = Poa: (3.42) 
This version of Maxwell’s equations in Lorenz gauge becomes 
eth — REAP = 4a, 
AY’, =0. (3.43) 


It is generally agreed that this second version is correct (although there is no experimental 
evidence one way or the other). To resolve such ambiguities, the following rule of thumb 
should be applied: the simple replacements 7 — g and comma — semicolon should be 
used in equations formulated using physically measurable quantities in the local Lorentz 
frame (F“” is physically measurable while A” is not). For a fuller discussion, see MTW, 
box 16.1. 


3.2.2 Geodesics 


An uncharged test body follows a trajectory given by Eq. (3.32) with e, = 0, namely, 
Dut /Dr = 0. This equation can easily be written in the form 


axt! hie dx® dx? 


= 3.44 
dr? a8 dr dr ( ) 
This is the geodesic equation. It is also the curve that extremizes the invariant action 
dx! dx” \'/? 
I ——— ws dt =— d 3.45 
NG i («. di di ) / T ( ) 


between two fixed events in spacetime. 

The mathematics of measurements made by atomic clocks and rigid measuring rods 
follow the same rules since the structure of such measuring devices is governed by 
solutions of the nongravitational laws of physics. In special relativity, the proper time 
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between two events separated by an infinitesimal coordinate displacement dx% as measured 
by any atomic clock moving on a trajectory that connects the events, is given by 


dr = (—Nuvdxdx’)'? , (3.46) 


if the separation is timelike, that is if 7,,,dx"dx” < 0. The proper distance between two 
events as measured by a rigid rod joining them is given by 


db (iucmary (3.47) 


if the separation is spacelike, that is, if n,,,dx“dx” > 0. These results are independent of 
the coordinates used. Then in curved spacetime we have 


dt = (—gyydx"dx”)'/? [timelike] —> g,,,dx"dx” <0, 
dl = (gy,dx"dx”)'/? [spacelike] —> g,,,dx“dx” > 0. (3.48) 


There is a third class of separation dx* between events, those for which, in a local 
Lorentz frame, 


Nyvax"dx” =0. (3.49) 


These are called null or lightlike separations, and pairs of events that satisfy this condition 
are connectible by light rays. It is a tenet of special relativity that light rays move along 
straight, null trajectories, that is, if kK" = dx/do is a tangent vector to a light-ray 
trajectory, then 


dk#/do =0,  tpktk’ =0. (3.50) 


where o is a parameter labeling points along the trajectory. It should not be forgotten, 
however, that this is not a postulate but rather is a consequence of Maxwell’s equations, 
valid only in the “geometrical optics” limit, in which the characteristic wavelength \ ~ 
(k°)—! is small compared to the scale £ over which the amplitude of the wave changes. 
(For example, £ might be the radius of curvature of a spherical wavefront.) 

Since the first of Eqs. (3.50) can be written, in flat spacetime 


dk# [do = (dx” /do)k* , =k’k*, =0, (3.51) 
then EEP yields the equations 
We =0,  SyvkhkY =0, (3.52) 


that is, the trajectories of light rays in the geometrical optics limit are null geodesics. 


3.2.3 Actions and energy-momentum tensors 


Another useful and important form of the equations of motion for matter and nongravita- 
tional fields can be derived in the case where the equations are obtained from a covariant 
action principle. This will essentially always be the case, for the following reason: in 
special relativity, all modern viable theories of nongravitational fields and their interactions 
take an action principle as their starting point, leading to an action /yg. The use of EEP does 
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C Box 3.1 | Maxwell's equations and null geodesics 


It is useful to derive the equation for null geodesics directly from the curved-spacetime form of Maxwell’s 
equations, in order to illustrate the role and the limits of validity of the geometrical-optics assumption. In 
curved spacetime, the geometrical-optics limit requires that A be small compared both to £ and to 7, the 
scale over which the background geometry changes (72 is related to the Riemann curvature tensor), that is, 


A _A 
noe (3.53) 


In this limit, the electromagnetic vector potential can be written in terms of a rapidly varying real phase and 
a slowly varying complex amplitude in the form (see MTW, section 22.5, for details) 


Ay = (dy + by +...)e°/*, (3.54) 


where 0 is a real phase, a,,, 6,,, .. . are complex amplitudes, and ¢ is a formal expansion parameter that 
keeps track of the powers of \/L. Ultimately, one takes only the real part of A,,, in any physical calculations. 
We define the wave vector 


Mis Sig 1S a eh (3.55) 
Then Maxwell's equations in Lorenz gauge, Eqs. (3.43), yield 
1 
eon = Ezc Fa aay = 0) gue, 
G 


Sg el 


1 j 
- hot? (a + eb!) ++ (2k? a" +k? ga") + ae) gare, (3.56) 
(é € 
Setting the coefficients of each power of € equal to zero, we obtain for the leading terms in each equation 
k*a, =0, (3.57) 
n=, (3.58) 


inother words, the amplitude is orthogonal to the wave vector, and the wave vectoris null. Taking the gradient 
of Eq. (3.58) and noting that k,,.., = k,,,,, since k,, is itself a gradient, we get 


kK, =0 (3.59) 


which is the geodesic equation for k"’. The trajectory x“ (c7) of the ray can then be shown to be related to 
k* by the differential equation dx" (a) /do = k(x”), where o is an affine parameter along the ray. For 
discussion of the higher-order terms in Eqs. (3.56), see MTW, section 22.5. 


not alter the fact that the equations of motion are derivable from an action. Consequently, 
one is led in curved spacetime to an action of the general form 


Inc = | bvodts, 


LNG = Lna (qa, DA Suv Syv,B) > (3.60) 
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where q4 and q4,,, are the nongravitational fields under consideration and their first partial 
derivatives (e.g.,u", A,,, A, ,,-..) and g,,, and g,,,,g are the metric and its first derivative. 
(The extension to second and higher derivatives is straightforward). The action principle 
dlc = 0 is covariant, thus, under a coordinate transformation, Cjg must be unchanged in 
functional form, modulo a divergence (see Trautman (1962) for discussion). Consider the 
infinitesimal coordinate transformation 


xt xt 4 gH, (3.61) 


Then the metric changes according to [see, for example, Eq. (3.12)], 


OS pv = —Zuae” yp =Biae ji = Banat” : (3.62) 
Assume the matter and nongravitational field variables change according to 
6q4 = Gaye — ga", (3.63) 


where d!!, are functions of x* and encode the tensorial transformation properties of q4; the 
second term accounts for the spacetime variation of g4. Under this transformation, Lyng 
changes by 


OLNG OLNG ) OLNG OLNG 
Bi i ae hock, “GEA 
> Oy Ola, Bay auree 


Substituting Eqs. (3.62) and (3.63), integrating by parts, dropping divergence terms, and 
demanding that Cg be unchanged for arbitrary functions €° yields the “Bianchi identities” 


dLNG uw LNG 1 Veer 1/2 
t Via mo al i hae =v; 
) ae oe + (i, baa), 58uv.a(—g) OT [gu (=g) | 0 


WV 


(3.65) 


where 0£LyG/6q, is the “variational” derivative of Lyg defined, for any variable ~, by 


dLnG _ OLNG O (OLNG 
= 3.66 
py Op Ox ( Ob ) ; oe 
and 7#” is the “energy-momentum tensor,” defined by 
qe* =2(=¢)-" Exe (3.67) 
Sv 
Using the fact that 
0 a 
ane) Srey as (3.68) 


we can rewrite Eq. (3.65) in the form 


5L 5L 
Tey = (-g) > ac = (4, 5 xe) (3.69) 
A WAS 
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However, the nongravitational field equations and equations of motion are obtained by 
setting the variational derivative of C£xg with respect to each field variable q4 equal to 
zero, that is, 


dLna/dq4 = 0, (3.70) 
which by Eq. (3.69) is equivalent to 
Tov = 0. (3.71) 


Thus, the vanishing of the divergence of the energy-momentum tensor 7“” is a conse- 
quence of the nongravitational equations of motion. This result could also have been 
derived, first working with £yg in a local Lorentz frame to obtain the equation 7% , =0 
by the above method, then using EEP to obtain Eq. (3.71). Notice that Equation “3.65) 
is a consequence purely of universal coupling (EEP) and of the invariance of the 
nongravitational action, and is valid independent of the field equations for the gravitational 
fields. 

The energy-momentum tensor 7“” for charged particles and electromagnetic fields 
may be obtained from the action /yg, Eq. (3.26), by first rewriting it as an integral over 
spacetime in the form 


dct! dx \'/? 
ING = Ym f re) O4 [x — x°(r)|drd*x 


+ pa ex f Aulx ot x°(r)|drd‘x 

Ha GupB 1/2 
— ze | shee! FuFaa(—g)' Pax, (3.72) 
Since only g,,, (and not its derivatives) appears in Jc, we obtain 


1/2 OLNG 


THY = 2(—g)~ 2% 
pV 


1 1 
= = Lima Dowie & be —x,(r)] 4 Fi (Fron — re) , (3.73) 
where we have used the fact that 


1 
eee (3.74) 


3.2.4 Hydrodynamics 


So far, our model for nongravitational physics has been that of point charged particles and 
their associated electromagnetic fields (we have ignored the weak and strong interactions 
of the standard model of particle physics). For some applications, this is a useful model, 
but for most of the applications of this book it is too microscopic. Instead, we will use the 
perfect fluid as our model for matter. This model is a “coarse graining” of the properties 
of matter over scales that are large compared to atomic scales, but small compared to 
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the scales over which the bulk properties of the fluid vary. The matter is assumed to be 
electrically neutral over these scales, so that the direct effects of electromagnetic fields can 
be ignored. Thus, one can speak of density, pressure, and velocity of fluid elements at a 
point within the fluid. A perfect fluid is one that has negligible viscosity, heat transport, and 
shear stresses. It is then possible to show that the energy-momentum tensor for the fluid 
has the following property: in a local Lorentz frame that is momentarily comoving with a 
chosen element of the fluid, the energy-momentum tensor for that element has the form 


TP” — diag[p(1 +11), p, p, pl, (3.75) 


where p is the rest-mass-energy density of atoms in the fluid element, pII is the density 
of internal kinetic and thermal energy in the fluid element (II then is the specific energy 
density), and p is the isotropic pressure, all measured in the locally comoving Lorentz 
frame. This can also be written in the Lorentz invariant form 


TH? — o(1 + T)uMu’ + p(u@u? + n*”), (3.76) 


where u/ = dx! /dr is the four-velocity of the fluid element (= oe in the comoving frame). 
Then in curved spacetime, 7"” has the form 


Tw = (p+ pll + p)utu” + pe”, (3.77) 
and satisfies the equation of energy-momentum conservation 
Pee") (3.78) 


if the nongravitational equations of motion hold. This can also be derived from Eq. (3.73) 
using suitable techniques in relativistic kinetic theory (see Ehlers 1971). 

In our microscopic approach, we treated each particle has having a constant mass mq. 
In coarse-graining to obtain a fluid approximation, we assume that the fluid has a chemical 
composition that is static and homogeneous, and that baryon number is conserved, so that 
we can treat the rest mass-energy of a fluid element as directly proportional to its total 
baryon number. In our fluid, therefore, the number of baryons within a given comoving 
fluid element of volume Y is constant by definition (we obviously are ignoring the effects 
of a possible nonconservation of baryons, which are utterly negligible on the timescales of 
interest). If p is the total mass energy in the fluid element divided by its volume, then we 
have the conservation statement 


d(pV) /dt = 0 (3.79) 


valid in the comoving frame. But because V~'dV/dt = V - v where v is the velocity 
field within the fluid element (v = 0 only at a single fiducial point within the comoving 
element), this conservation statement is equivalent to 


dp Op 

“4 90V-v=0>5> 4V.- =0. 3.80 
merry ae t V: fev) (3.80) 
This is the continuity equation for rest mass-energy, expressed in the locally comoving 
Lorentz frame. The Lorentz invariant version of this equation, valid in any local Lorentz 
frame, is 


0 0 : 
jy (eu) + a5 (ou!) = (ou), n= 0. (3.81) 
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In curved spacetime, this then has the form 
(pul). = 0. (3.82) 


Equations (3.77), (3.78) and (3.82) constitute the fundamental equations of hydrodynamics 
in curved spacetime. To complete the system of equations, one needs to specify an equation 
of state p = p(p, T, Y,...) that relates the pressure to the density p, temperature 7, 
chemical composition Y and other variables. 

Substituting Eq. (3.77) into (3.78) and separating the result into a component along the 
four-velocity u/’ and components orthogonal to it, we obtain 


Du . 7 
(utpP)>— +18 Bt u%u?)\p 2 =0, (3.83a) 


d 
+ (u+p)uis =0, (3.83b) 


where pp = p(1 + II), and Du*/Dr = u’u%.,. The first equation is the curved spacetime 
version of the Euler equation of hydrodynamics, with y + p playing the role of inertial 
mass density, and with the Christoffel symbols embedded within the covariant derivative 
producing the gravitational accelerations. Inserting the continuity equation (3.82) into 
Eq. (3.83b) and recalling that w“.,, = V~'dV/dr (the curved spacetime version of 
V -v=V—'dV/dt), we obtain 

d 


d 
7, (ell) +p-V = 0, (3.84) 


which is nothing but the local first law of thermodynamics, with EF = pITV, namely, 
dE + pdV =0. (3.85) 


But since dE+pdV = TdS in general, where S is the entropy, this implies that the fluid flow 
is isentropic (dS = 0). This property was actually built into the energy-momentum tensor 
from the start by assuming the perfect fluid form. Entropy changes are associated with 
heat transport, and to include such effects we would have had to add an additional piece to 
TH” given by 2u\“q”), where q” is a heat-flux four-vector, along with auxiliary equations 
relating g” to temperature gradients, energy-generation mechanisms, and so on. For further 
discussion of nonperfect fluids, see MTW, section 22.3, Ehlers (1971), and Andersson and 
Comer (2007). 

The law of conservation of rest mass, Eq. (3.82) permits us to introduce an auxiliary 
density variable p*, sometimes called the conserved density, which will prove to be very 
useful throughout this book. By noticing that for any four-vector field A”, 


AP = ly A (3.86) 


Tz 
we can therefore write 
0 O 
= bY. * fats) = 
v—8(0U") n= [V—gput) n= 30" + a Glew) = 9, (3.87) 
where we define 


p’ =p/—en". (3.88) 
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Thus p* satisfies an “Eulerian” continuity equation in any (t,x) coordinate system. This 
“conserved” density is useful because for any function f(t, x) defined in a volume V whose 
boundary is outside the matter 


) * Bo Of B 
sh (ox tax)ex= fp at Xx, (3.89) 


where df/dt = Of/0t+v- Vf. To establish this we apply the continuity equation for p* and 
convert the integral of a divergence into a surface integral that vanishes outside the matter. 
Notice that Eq. (3.89) implies that 


—=0, (3.90) 


where 
m= f otdx= f pu'(-2)'ax= | ov, (3.91) 
V V V 


is the total rest mass energy of the particles in the volume V; VY, = u®(—g)!/?d3x is the 
proper volume element. 
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ee 


To obtain a complete metric theory of gravity one must now specify field equations for 
the metric and for the other possible gravitational fields in the theory. There are two 
alternatives. The first is to assume that these equations, like the nongravitational equations, 
can be derived from an invariant action /g(¢4,¢4,,), which will be a function of the 
gravitational fields 64 (which could include g,,,,) and their derivatives. The complete action 
is thus 


I= Io(a; Psp) air NACHE qA,p> Syv> avg) : (3.92) 
Variation with respect to @,4 yields the gravitational field equations 


dL 4 OS wy OLNG _ 
Oba Oba 68 uv = 


(3.93) 


or, using Eq. (3.67), 


LG 1 1/2 ppv OSpv 
a 5-2) ae Oba (3.94) 
Theories of this type are called Lagrangian-based covariant metric theories of gravity. 
Many important general properties of such theories are described by Lee et al. (1974). 
The other alternative is to specify gravitational field equations that are not derivable from 
an action. These are called non-Lagrangian-based theories. Although many such theories 
have been devised, they have not met with great success in agreeing with experiment. All 
the metric theories to be described in Chapter 5 are Lagrangian based. 
In any metric theory of gravity, matter and nongravitational fields respond only to 
the spacetime metric g. In principle, however, there could exist other gravitational fields 
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besides the metric, such as scalar fields, vector fields, and so on. If matter does not couple 
to these fields what can their role in a gravitation theory be? Their role must be that of 
mediating the method by which matter and nongravitational fields generate gravitational 
fields and produce the metric. Once determined, however, the metric alone interacts with 
the matter as prescribed by EEP. What distinguishes one metric theory from another, 
therefore, is the number and kind of gravitational fields it contains in addition to the 
metric, and the equations that determine the structure and evolution of these fields. From 
this viewpoint, one can divide all metric theories of gravity into two fundamental classes: 
“purely dynamical” and “prior geometric.” (This division is independent of whether or not 
the theory is Lagrangian based.) By “purely dynamical metric theory” we mean any metric 
theory whose gravitational fields have their structure and evolution determined by coupled 
partial differential field equations. In other words, the behavior of each field is influenced 
to some extent by a coupling to at least one of the other fields in the theory. By “prior 
geometric” theory, we mean any metric theory that contains “absolute elements,” fields 
or equations whose structure and evolution are given a priori and are independent of the 
structure and evolution of the other fields of the theory. These “‘absolute elements” could 
include flat background metrics 7, cosmic time coordinates 7, and algebraic relationships 
among otherwise dynamical fields, such as 


Suv = +kyky, (3.95) 


where h,,, and k,, may be dynamical fields. Note that a field may be absolute even if it is 
determined by partial differential equations, as long as the equation does not involve any 
dynamical fields. For instance, a flat background metric is specified by the field equation 
Riem(7) = 0, or a cosmic time function is specified by the field equations V,,V,7=0, 
V,IV"T = —1, where the gradients and inner products are taken with respect to a 
nondynamical background metric, such as 77. 

General relativity is a purely dynamical theory since it contains only one gravitational 
field, the metric itself, and its structure and evolution is governed by partial differential 
equations (Einstein’s equations). Brans-Dicke theory and its generalizations are purely 
dynamical theories; the field equation for the metric involves the scalar field (as well as 
the matter as source), and the field equation for the scalar field involves the metric. In 
Chapter 5, we will discuss these and other theories in more detail. 

By discussing metric theories of gravity from this broad, “Dicke” point of view, it is 
possible to draw some general conclusions about the nature of gravity in different metric 
theories, conclusions that are reminiscent of the Einstein Equivalence Principle, but that 
will be given a new name: the Strong Equivalence Principle. 

Consider a local, freely falling frame in any metric theory of gravity. Let this frame 
be small enough that inhomogeneities in the external gravitational fields can be neglected 
throughout its volume. However, let the frame be large enough to encompass a system 
of gravitating matter and its associated gravitational fields. The system could be a star, 
a black hole, the solar system, or a Cavendish experiment. Call this frame a “quasilocal 
Lorentz frame.” To determine the behavior of the system we must calculate the metric. The 
computation proceeds in two stages. First, we determine the external behavior of the metric 
and gravitational fields, thereby establishing boundary values for the fields generated by the 
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local system, at a boundary of the quasilocal frame “far” from the local system. Second, 
we solve for the fields generated by the local system. But because the metric is coupled 
directly or indirectly to the other fields of the theory, its structure and evolution will be 
influenced by those fields, particularly by the boundary values taken on by those fields 
far from the local system. This will be true even if we work in a coordinate system in 
which the asymptotic form of g,,, in the boundary region between the local system and 
the external world is that of the Minkowski metric. Thus, the gravitational environment 
in which the local gravitating system resides can influence the metric generated by the 
local system via the boundary values of the auxiliary fields. Consequently, the results of 
local gravitational experiments may depend on the location and velocity of the frame 
relative to the external environment Of course, local nongravitational experiments are 
unaffected since the gravitational fields they generate are assumed to be negligible, and 
since those experiments couple only to the metric whose form can always be made locally 
Minkowskian. Local gravitational experiments might include Cavendish experiments, 
measurements of the acceleration of massive bodies, studies of the structure of stars and 
planets, and so on. We can now make several statements about different kinds of metric 
theories (Will and Nordtvedt, 1972). 

(a) A theory that contains only the metric g yields local gravitational physics that is 
independent of the location and velocity of the local system. This follows from the fact 
that the only field coupling the local system to the environment is g, and it is always 
possible to find a coordinate system in which g takes the Minkowski form at the boundary 
between the local system and the external environment. Thus, the asymptotic values of 
Zyv are constants independent of location, and are asymptotically Lorentz invariant, thus 
independent of velocity. General relativity is an example of such a theory. 

(b) A theory that contains the metric g and dynamical scalar fields ¢4 yields local 
gravitational physics that may depend on the location of the frame but that is independent 
of the velocity of the frame. This follows from the asymptotic Lorentz invariance of the 
Minkowski metric and of the scalar fields, except now the asymptotic values of the scalar 
fields may depend on the location of the frame. An example is scalar-tensor theory, where 
the asymptotic scalar field determines the value of the gravitational constant, which can 
thus vary as ¢ varies. 

(c) A theory that contains the metric g and additional dynamical vector or tensor fields or 
prior-geometric fields yields local gravitational physics that may have both location- and 
velocity-dependent effects. This will be true, for example, even if the auxiliary field is a flat 
background metric 7. The background solutions for g and 77 will in general be different, 
and therefore in a frame in which g,,,, takes the asymptotic form diag(—1, 1, 1, 1), nv will 
in general have a form that depends on location and that is not Lorentz invariant (although 
it will still have vanishing curvature). The resulting location and velocity dependence in 
7 will act back on the local gravitational problem. Be reminded that these effects are a 
consequence of the coupling of auxiliary gravitational fields to the metric and to each 
other, not to the matter and nongravitational fields. For metric theories of gravity, only gy, 
couples to the latter. 

These ideas can be summarized in the form of a principle called the Strong Equivalence 
Principle whose statement is given in Box 3.2. 
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The Strong Equivalence Principle (SEP) 


1. WEP is valid for self-gravitating bodies as well as for test bodies. This is known as the Gravitational Weak 
Equivalence Principle (GWEP). 

2. The outcome of any local test experiment is independent of the velocity of the (freely falling) apparatus. 

3. The outcome of any local test experiment is independent of where and when in the universe it is 
performed. 


The distinction between SEP and EEP is the inclusion of bodies with self-gravitational 
interactions (planets, stars) and of experiments involving gravitational forces (Cavendish 
experiments, gravimeter measurements). Note that SEP contains EEP as the special case 
in which local gravitational forces are ignored. 

It is tempting to ask whether the parallel between SEP and EEP extends as far as a 
Schiff-type conjecture; for example, “any theory that embodies GWEP also embodies 
SEP.” As in Section 2.5, we can give a plausibility argument in support of this, for the 
special case of metric theories of gravity with a conservation law for total energy (Haugan, 
1979). Generally speaking, this means Lagrangian-based theories. Consider a local self- 
gravitating system moving slowly in a weak, static, external gravitational potential U(x). 
We assume that the laws governing its motion can be put into a quasi-Newtonian form, 
with the conserved energy E, given by 


1 
E, = Mp [ + A — U(X) + 0(U?, V*, ur?) ; (3.96) 


where Mr = Mo — Ep(X, V) is the rest mass of the self-gravitating system, with Mo the 
total rest-mass of its constituents, and Eg the gravitational binding energy, given by 


— ao 
Ep(X, V) = E9, + bmi U(x) — som V! VP OCF oo.) (3.97) 


(see Section 2.4 for detailed definitions). Here we use units in which the speed of light 
as measured far from the local system is unity. The position and velocity dependence in 
Eg can manifest itself, for example, as position and velocity dependence in the locally 
measured gravitational constant. For two bodies in a local Cavendish experiment separated 
by a distance 7, the local gravitational constant is given by 
2 
G.=——-F. = y _dEx (3.98) 


r > 
mm mim, dr 


and thus the anomalous mass tensors will contribute to G, (see Section 6.6 for an explicit 
calculation in the PppN framework). However, a cyclic gedanken experiment identical to 
that presented in Section 2.4 shows that the anomalous mass tensors 5m} and dm} also 
generate violations of GWEP, 


: dmkt omi* 
Al =o! P ke I k ; 
+(Z)u J Mr gz, (3.99) 
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where g = VU. Hence the validity of GWEP (5m) = dm" = 0) implies no preferred- 
location or preferred-frame effects, thence SEP. In Chapters 4, 5, and 6, we will see specific 
examples of GWEP and SEP in action in the post-Newtonian limits of arbitrary metric 
theories of gravity. 

This discussion of the coupling of auxiliary fields to local gravitating systems indicates 
that if SEP is valid, there must be one and only one gravitational field in the universe, the 
metric g. These arguments are only suggestive however, and it is very unlikely that they 
can be made more rigorous. 

The assumption that there is only one gravitational field is the foundation of many 
so-called derivations of general relativity. One class of derivations uses a quantum- 
field-theoretic approach. One begins with the assumption that, in perturbation theory, 
the gravitational field is associated with the exchange of a single massless particle of 
spin 2 (corresponding to a single second-rank tensor dynamical field), and by making 
certain reasonable assumptions that the S-matrix be Lorentz invariant or that the theory 
be derivable from an action, one can generate the full classical Einstein field equations 
(Weinberg, 1965; Deser, 1970). Another class of derivations attempts to build the most 
general field equation for g out of tensors constructed only from g, subject to certain 
constraints (no higher than second derivatives, for instance). By demanding that the field 
equations should be compatible with the matter equations of motion T“” .,, = 0, one is 
led (except for the possible cosmological constant term) to Einstein’s equations. For a 
review of these and other “derivations” of general relativity the reader is referred to MTW, 
box 17.2. 

However, the implicit use of SEP in all these derivations cannot be emphasized enough. 
Empirically, it has been found that virtually every metric theory other than general relativity 
introduces auxiliary gravitational fields, either dynamical or prior geometric, and thus 
predicts violations of SEP at some level. General relativity seems to be the only metric 
theory that embodies SEP completely. Thus, the wide variety of derivations of general 
relativity that assume SEP, plus evidence from alternative theories lends some credence to 
the conjecture 


SEP => [General Relativity ] . (3.100) 


In Chapter 8 we will discuss experimental evidence for the validity of SEP. 
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We have seen that, despite the possible existence of long-range gravitational fields in 
addition to the metric in various theories of gravity, the postulates of metric theories 
demand that matter and nongravitational fields be completely oblivious to them. The only 
gravitational field that enters the equations of motion is the metric g. The role of the other 
fields that a theory may contain can only be that of helping to generate the spacetime 
curvature associated with the metric. Matter may create these fields, and they plus the 
matter may generate the metric, but they cannot act back directly on the matter. Matter 
responds only to the metric. 

Consequently, the metric and the equations of motion for matter become the primary 
entities for calculating observable effects, and all that distinguishes one metric theory 
from another is the particular way in which matter and possibly other gravitational fields 
generate the metric. 

The comparison of metric theories of gravity with each other and with experiment 
becomes particularly simple when one takes the slow-motion, weak-field limit. This 
approximation, known as the post-Newtonian (PN) limit, is sufficiently accurate to encom- 
pass a wide range of past and future solar-system tests. The post-Newtonian approximation 
as presented in this chapter is intended to be valid within the near-zone of the system, 
corresponding to a sphere smaller than one gravitational wavelength in size. In a somewhat 
different guise, it can also be applied to gravitational waves; this will be addressed in 
Chapter 11. Strictly speaking, the post-Newtonian approximation cannot be applied to 
systems with compact objects, such as black holes or neutron stars, however, with suitable 
modifications, a version of the post-Newtonian approximation can be used to describe 
the orbital motion of compact bodies, provided that the orbital motions are slow and the 
interbody gravitational fields are weak, and that the bodies are sufficiently small compared 
to the orbital separation that tidal interactions can be ignored (see Chapter 10). The post- 
Newtonian approximation is not suited to cosmology where rather different assumptions 
and approximations must be made (see Section 12.4). 

In Section 4.1, we discuss the post-Newtonian limit of metric theories of gravity, and 
devise a general form for the post-Newtonian metric for a system of perfect fluid. This form 
should be obeyed by most metric theories, with the differences from one theory to the next 
occurring only in the numerical coefficients that appear in the metric. When the coordinate 
system is appropriately specialized to a standard “gauge,” and arbitrary parameters are 
used in place of the numerical coefficients, the result, described in Section 4.2 is known 
as the parametrized post-Newtonian (PPN) formalism, and the parameters are called PPN 
parameters. In Section 4.3, we discuss the effect of Lorentz transformations on the PPN 
coordinate system, and show that some theories of gravity may predict gravitational 
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effects that depend on the velocity of the gravitating system relative to a preferred frame, 
presumably attached to the rest of the universe. In Section 4.4, we analyze the existence 
of post-Newtonian integral conservation laws for energy, momentum, angular momentum 
and center-of-mass motion within the PPN formalism, and show that metric theories possess 
such conserved quantities if and only if their PPN parameters obey certain constraints. 

This formalism then provides the framework for a discussion of specific alternative 
metric theories of gravity (Chapter 5), and for the analysis of solar system tests of 
relativistic gravitational effects (Chapters 7 — 9). 


4.1 The Post-Newtonian Approximation 
See ee ee 


4.1.1 Newtonian gravity and the Newtonian limit 


In the solar system, gravitation is weak enough for Newton’s theory of gravity to 
adequately account for all but the most minute effects. This is also true for almost all stars 
(excepting neutron stars, and black holes, of course), all galaxies (except in the vicinity 
of their central massive black holes), and even clusters of galaxies and the large-scale 
structure of the universe. To an accuracy of at least a part in 10°, light rays travel through 
the solar system on straight lines, and test bodies move according to 


a= VU, (4.1) 


where a is the body’s acceleration and U is the Newtonian gravitational potential produced 
by rest-mass density p according to 


V-U = —4np, U(t,x) -[& (4.2) 


Ix — ze 
Here and for the remainder of this book we use units in which the speed of light c is unity, 
and the gravitational constant as measured far from the solar system G is unity. A perfect, 
nonviscous fluid obeys the usual Eulerian equations of hydrodynamics 


Op 
=—t+V- 0 
+ V (pr) =0, 
d 

p= pVU-Np, (4.3) 
where v is the velocity of an element of the fluid, ¢ is the rest-mass density of matter in 
the element, p is the total pressure (matter plus radiation) on the element, and d/dt = 
O/Ot + v- V is the time derivative following the fluid. 

From the standpoint of a metric theory of gravity, Newtonian physics may be viewed 
as a first-order approximation. Consider a test body momentarily at rest in a static 
external gravitational field. From the geodesic equation (3.44), the body’s acceleration 
a! = d’x//df ina static (t,x) coordinate system is given by 


1, 
al = Ty = 58) S004 (4.4) 
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Far from the Newtonian system, we know that, in an appropriately chosen coordinate 
system, the metric must reduce to the Minkowski metric (see Subsection 4.1.3) 

Suv Nv = diag(—1,1,1,1). (4:5) 
In the presence of a very weak gravitational field, Eq. (4.4) can yield Newtonian 
gravitation, Eq. (4.1) only if 

gw 6, go) =-14+2U. (4.6) 


It is straightforward to show that, with this approximation and an energy-momentum tensor 
for perfect fluids given by 


TM =p, M=pv, T*= pik +pd*, (4.7) 
the Eulerian equations of Motion 4.3 are equivalent to 
ee eee T =O. (4.8) 


where we retain only terms of lowest order in v? ~ U ~ p/p. 

But the Newtonian limit no longer suffices when we begin to demand accuracies greater 
than a part in 10° in the solar system. For example, it cannot account for Mercury’s 
anomalous perihelion advance of ~5 x 10~’ radians per orbit. Thus we need a more 
accurate approximation to the spacetime metric that goes beyond, or “post” Newtonian 
theory, hence the name post-Newtonian limit. 


4.1.2 Post-Newtonian bookkeeping 


The key features of the post-Newtonian limit can be better understood if we first develop 
a “bookkeeping” system for keeping track of “small quantities.” In the solar system, the 
Newtonian gravitational potential U is nowhere larger than 10~° (in geometrized units, U, 
or more precisely, GU/c?, is dimensionless). Planetary velocities are related to U by virial 
relations which yield 


vw. (4.9) 
The matter making up the Sun and planets is under pressure p, but because the bodies are in 


hydrostatic equilibrium, this pressure is comparable to pU (more precisely Vp = pVU), 
in other words 


p/p~U. (4.10) 


From thermodynamics, other forms of energy (compressional energy, radiation, thermal 
energy, magnetic energy, etc.) are related to pressure by pV ~ E ~ pllV, and thus II, the 
ratio of energy density to rest-mass density, is related to U by 


Raia, (4.11) 


These four dimensionless quantities are assigned a bookkeeping label € that denotes their 
“order of smallness”: 


Unvwvp/p~ll~ Oe). (4.12) 
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Then single powers of velocity v are O(e!/?), U? is O(e?), Uv is O(e€/2), UTI is O(e?), 
and so on. Also, since the time evolution of the system is governed by the motions of its 
constituents, we have that 


0/Ot~v-V, (4.13) 
and thus that 
|0/0t| 1/2 
1 /ax| O(e'/"). (4.14) 


We can now analyze the “post-Newtonian” metric using this bookkeeping system. The 
action, Eq. (3.26), from which one can derive the geodesic equation (3.44) for a single 
neutral particle, may be rewritten, after dropping the electromagnetic terms, 


dr dr’ a 
Inc = lo = mo | ( Suv “) dt 


=—mo 200 — 2go;v! — giv’ at (4.15) 
) »/ 


where r = (t,r) describes the particle’s position in spacetime, and vw’ = dr//dt is its 
three-dimensional velocity vector. The integrand in Eq. (4.15) may thus be viewed as the 
Lagrangian L for a single particle in a metric gravitational field. From Eq. (4.6), we see 
that the Newtonian limit corresponds to 


1 
L=—m)(1—2U—v’)? ~ —mp + 5mov" +moU, (4.16) 


as can be verified using the Euler-Lagrange equations. In other words, Newtonian physics 
is given by an approximation for L correct to O(€). Post-Newtonian physics must therefore 
involve those terms in L of the next order, O(c). 

But what about odd-half-order terms, O(e!/2) or O(e*/2). Odd-half-order terms must 
contain an odd number of velocities v or of time derivatives 0/0t. Since these factors 
change sign under time reversal, odd-half-order terms must represent dissipative processes 
in the system. But, roughly speaking, conservation of rest-mass energy prevents terms of 
O(e!/?) from appearing in L, and conservation of energy in the Newtonian limit prevents 
terms of O(e*/*). Beyond O(e?), different theories may make different predictions. In 
general relativity, for example, the conservation of post-Newtonian energy prevents terms 
of O(€5/?). However, terms of O(e7/*) can appear; they represent the dissipative effects of 
gravitational radiation reaction. 

In order to express L to O(e*), we must know the various metric components to an 
appropriate order: 


_\ 1/2 
i= (1 ~2U — ¥ = dgq9[0(2)] — 2gy[0(2/2)]/ — 5gulO(©)|vv*) . (4.17) 
Thus the post-Newtonian limit of any metric theory of gravity requires a knowledge of 


goo to Ole), 
gy to O(e/), 
gk to O(e). (4.18) 
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The post-Newtonian propagation of light rays may also be obtained using the above 
approximations to the metric. Since light moves along null trajectories (dt = 0), the 
Lagrangian L must be formally identical to zero. In the first-order “Newtonian” limit this 
implies that light must move on straight lines at speed 1, that is, 


0=L=(1-v)'?, Y=1. (4.19) 


In the next, post-Newtonian order, we must have 


1/2 


0=L=(1—2U—Vv — dg%{O(c)]v’v') (4.20) 


Thus to obtain post-Newtonian corrections to the propagation of light rays, we need 
to know 


£00 to Ole), 
Se to Ole). (4.21) 


In a similar manner, we can verify that if we take the perfect-fluid energy-momentum 
tensor 


THY’ = (p+ pll+p)u"u” + pg”, (4.22) 


expanded through the following orders of accuracy, 


T° to pO(e), 
TY to pO(e’?), 
T* to pO(e), (4.23) 


and combined with the post-Newtonian metric, then the equation of motion 7””.,, = 0 will 
yield consistent “post-Eulerian” equations of hydrodynamics. 


4.1.3 Isolated post-Newtonian systems 


We must now specify the coordinate system to be used for the post-Newtonian limit. We 
imagine an isolated post-Newtonian system residing in a homogeneous, isotropic universe. 
We choose a coordinate system whose outer region far from the isolated system is in free 
fall with respect to the surrounding cosmological model, and is at rest and nonrotating 
with respect to a frame in which the universe appears isotropic (universe rest frame). In 
this outer region, we expect the physical metric to have the form 
2 
ds? = —df? + (2) Syd! dx + Nyydxtdx” , (4.24) 
0 

where the first two terms comprise the standard spatially flat Friedmann-Robertson-Walker 
(FRW) line element of the current standard model of big-bang cosmology, and the final 
term represents the perturbation due to the local system, where a(t) is the cosmological 
scale factor (ao is its present value). Expanding a(t) about the present time, we obtain 


a(t) 


1 
=1+Apt— Hag ++... (4.25) 
ao 2 
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where Hp = do/do and qo = —Hp > Gq /ay are the present values of the Hubble parameter 
and deceleration parameter. 
We make the coordinate transformation 


t= t +A: (t,x"") + HA, (t,x), 
xf = x) + HyBi (¢ x") + HBBi(7,x"), (4.26) 


where we use Ho as an effective expansion parameter. Substituting into Eq. (4.24), and 
choosing 4, and Bi to eliminate all terms linear in Ho, we find, after dropping contributions 
corresponding to trivial boosts, rotations and displacements, that 4; = —r/2 and Bi = 
—x"t', and that to second order in Hp, the cosmological part of the metric takes the form 


Bo = —1— Hi} (242-77) , 
gh, = HB (83 = xi) 
Sin = Sie +H (283, — ¢754(2 + qo) — 17 6x — oa") ’ (4.27) 


Choosing BS = 4#?x!/(2 + qo) + 42x" and Ay = 4(1 + qo)r??’, we can eliminate the // 
dependence from g/,, set gj; = 0, and obtain finally 


Zo = 1 - Ropogx?x’4 ; 
20 =0, 
1 
Six = Oik _ 3 inkgX x4 7 (4.28) 


where, for the FRW metric, Ropog = HeqoSpqs Rojpg = 0, and Ripkg = He (55 pq — djqdkp): 
These are called Fermi normal coordinates for describing a local freely falling frame (PW, 
section 5.2.5). Then, including the contribution from the local system, we find the line 
element in the form 


/ / 1 
ds® = (nuivt + hyryr)dx dx” — HBr? aoa! ~ a (do? + sin?o'do?)| . (4.29) 


Normally, in finding the metric for an isolated system, we want to impose the boundary 
condition that the metric tend to the Minkowski metric 77,,, at infinity, however because 
of the cosmological background, such a condition will generate an error that grows with 
distance from the system. But because Hp ' S103 km this will happen very far away. To 
estimate where and how large the error might be, we note that a post-Newtonian potential 
of order em/r will be comparable to Hi? at a distance 


em 1 € m Ma 
~(S) ~3x 102k — 
ro (=) 3 x 103 km [+ ”) ; (4.30) 


and at ro, the error being made in imposing the normal boundary condition is of order 
(roHo)? ~ 5 x 10~7° for © ~ 107° and m ~ mo. This is far smaller than the error 
~ & ~ 107)? made by ignoring 2PN effects within the system itself. 

Thus for all solar-system or astrophysical applications of the post-Newtonian approxi- 
mation, we are justified in ignoring the surrounding cosmological setting when establishing 
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asymptotic boundary conditions on the metric. However, if the theory in question has 
auxiliary fields, the cosmological boundary conditions on those fields may be crucial (see 
Section 3.3 for further discussion). 

We will call the coordinate system thus constructed “local quasi-Cartesian coordinates.” 
In this coordinate system it is useful to define the following conventions and quantities: 


1. Unless otherwise noted, spatial vectors are treated as Cartesian vectors, with x* = x,. 
2. Repeated spatial indices or the symbol |x| denotes a Cartesian inner product, for 
example 
xe Sey HP SB eh ae by ee. 


3. The volume element d°x = dxdydz. 


4.2 Building the ppn Formalism 
a ———— SS 


4.2.1 Post-Newtonian potentials 


We will assume throughout that the matter composing the system can be idealized as 
perfect fluid. For the purposes of most solar system or astrophysical situations, this is 
an adequate assumption. As we will see in more detail in Chapter 5, the post-Newtonian 
limit for a system of perfect fluid in any metric theory of gravity is best calculated by 
solving the field equations formally, expressing the metric as a sequence of post-Newtonian 
functionals of the matter variables, with possible coefficients that may depend on the 
matching conditions between the local system and the surrounding cosmological model 
and on other constants of the theory. The evolution of the matter variables, and thence of the 
metric functionals, is determined by means of the equations of motion 7“”.,, = 0 using the 
matter energy-momentum tensor and the post-Newtonian metric, all evaluated to an order 
consistent with the post-Newtonian approximation. The evolution of the cosmological 
matching coefficients is determined by a solution of the appropriate cosmological model. 
Thus, the most general post-Newtonian metric can be found by simply writing down 
metric terms composed of all possible post-Newtonian functionals of matter variables, 
each multiplied by an arbitrary coefficient that may depend on the cosmological matching 
conditions and on other constants, and adding these terms to the Minkowski metric to 
obtain the physical metric. Unfortunately, there is an infinite number of such functionals, 
so that in order to obtain a formalism that is both useful and manageable, we must impose 
some restrictions on the possible terms to be considered, guided in part by a subjective 
notion of “reasonableness” and in part by evidence obtained from known gravitation 
theories. Some of these restrictions are obvious: 


1. The metric terms should be of Newtonian or post-Newtonian order; no post-post- 
Newtonian or higher terms are included. 

2. The terms should tend to zero as the distance |x — x’| between the field point x and 
a typical point x’ inside the matter becomes large. This will guarantee that the metric 
becomes asymptotically Minkowskian in our quasi-Cartesian coordinate system. 
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3. The coordinates are chosen so that the metric is dimensionless. 

4. In our chosen quasi-Cartesian coordinate system, the spatial origin and initial moment 
of time are completely arbitrary, so the metric should contain no explicit reference 
to these quantities. This is guaranteed by using functionals in which the field point x 
always occurs in the combination x — x’, where x’ is a point associated with the matter 
distribution, and by making all time dependence in the metric terms implicit via the 
evolution of the matter variables and of the possible cosmological matching parameters. 

5. The metric corrections hoo, Ag; and Aj, should transform under spatial rotations as 
a scalar, vector, and tensor, respectively, and thus should be constructed out of the 
appropriate quantities. For variables associated with the matter distribution, examples 
are: scalars, p, |x — x’|, v?, v’ - (x — x’), etc.; vectors, v’/, (x — x’)/; and tensors, 
(x — x’)/(x — x')*, v’v'*, etc. For variables associated with the structure of the field 
equations of the theory or with the cosmological matching conditions, there are only 
two available quantities in the rest frame of an isotropic universe: scalar cosmological 
matching parameters or numerical coefficients; and a tensor, 6/*. In the rest frame of an 
isotropic universe, no vectors or anisotropic tensors can be constructed. 

6. The metric functionals should be generated by rest mass, energy, pressure, and velocity, 
not by their gradients. This restriction is purely subjective, and can be relaxed quite 
easily if there is ever any reason to do so. No reason has yet arisen. 

7. A final constraint is extremely subjective: The functionals should be “simple.” 


With those restrictions in mind, we can now write down possible terms that may appear 
in the post-Newtonian metric. 


(1) goo to O(e): We have already determined that go) = —1 + 2U + O(c”), where U 
is the Newtonian potential. However, we now define a new “Newtonian” potential U 
using the conserved density p* defined in Section 3.2.4, instead of the locally measured 
density p, in other words, henceforth, 


U(t, ) BY VU = —4rp*. 431 
x) =f x] ; Tp (4.31) 


We will use p* to define all post-Newtonian potentials. 
(2) gj to O(€): From condition (5), g;, must behave as a three-dimensional tensor potential 
under rotations, thus the only terms that can appear are 


gnlO(e)]: Udbn, Ujx, (4.32) 
where Uj is given by 
Untex) = f o*(ex/) 


The potential Uj, can be expressed more conveniently in terms of the “superpotential” 
X(t,x), given by 


(x= x')i(e —x')e 


3 
eoep (4.33) 


Xx) = if plex \\e— xe x. (4.34a) 


VX = 2U, (4.34b) 
X jk = Udje = Ux. (4.34c) 
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Thus, the only terms that we will consider in gj, are 
glO(©)]: Udit, Xjx- (4.35) 


(3) go; to O(e/?): These metric components must transform as three-vectors under 
rotations, and thus contain only the terms 


gilO(e)]: Vi, Wj, (4.36) 


where 


m= fox) ey, (4.37) 


The potentials V; and W; are also related to the superpotential XY by 
Xy =W)- Vj, (4.38) 
so that the only terms that we will consider here are 
glO(e?)]: Vi, Xoy- (4.39) 


(4) go0 to O(e*): This component should be a scalar under rotations. The only terms that 
we will consider are 


200 [O(€?)] : ‘dep 1, ®., ®3, Pa, ®s, Be, Oy, (4.40) 
where 
ri 
®; =| ee dx! , V78, = —4rp*v, 
«Ut 
7) =/ £ , dx, V7, = —4rp*U, 
*/TT! 
®; =| £ af ae V’0; = —4rp*II, 
p' 
i= | paw x V8, = —4ap, 
/ 
®. = le VAN OU (x x’) 2 e 
; / 7 e— | 
_ 7 wl == PF 25 
w= | o reer dx, 
ne x —x’ ‘x! — x" xx! 
oy= [fe ‘p " et (iI 2 = | exes", (4.41) 


Condition (7) has been used liberally to eliminate otherwise possible metric potentials 
such as V;V;U~!, ®;®3U~*, or U;U;. Should one of these terms ever appear in the 
post-Newtonian metric of a plausible gravitational theory, the formalism could be 
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modified accordingly. The potentials that we have introduced satisfy a number of useful 
relationships, including 


Vi; = Up, (4.42a) 

Xoo = B, + 28, — Bs — Be, (4.42b) 
a] 

by =U VUE f Viney. (4.42c) 
x—x 


The first two relationships may be derived by making use of the following identity, 
obtained using the equation of continuity for the conserved density p*, 


5 fo ex sx) = fo ex Wife a. (4.43) 


The third may be obtained by splitting x’ — x” into (x’ — x) + (x —x”) and x — x” into 
(x — x’) + (x’ — x”) in the numerators of ®, and identifying the four resulting terms. 


4.2.2 The standard ppN gauge 
We can restrict the form of the post-Newtonian metric somewhat by making use of 


the arbitrariness of coordinates embodied in statement (ii) of the Dicke framework. An 
infinitesimal coordinate or “gauge” transformation, 


x# =x + E(x”), (4.44) 
changes the metric to 
sol) = OF x” 
ene ~ Axe HBS 
= Suv (x7) _ Ev = fu 7 O(é’) . (4.45) 


We wish to retain the post-Newtonian character of g,,, and the quasi-Cartesian character 
of the coordinate system, and to remain in the universe rest frame, thus the functions €“ 
must satisfy: (i) €,,, +&,, are post-Newtonian functions; (ii) €,. +), — 0 far from the 
system; and (iii) |€“|/|x""| + 0 far from the system. The only “simple” functional that has 
this property is the gradient of the superpotential X,,,. Thus, we choose 


& = rAXo, & = A2X;, (4.46) 
and obtain, to 1PN order, 


200 = 800 — 2A1X 00, 
Boz = Boj — (Ar + A2)X gy, 
S5e = Sik — 2X2X jx . (4.47) 


In addition to the changes in the metric components induced by the coordinate change, 
we must also express the potentials that appear in the original metric g,,,, in terms of the 
new coordinates. Since the coordinate changes are of post-Newtonian order, they will only 
make a contribution to the post-Newtonian metric via the Newtonian potential U in goo. We 
recall that p*d*x is an invariant, thus p* (t,x)d°x = p* (,x)d°x, and thus we can write that 
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U(t,x) = 1 P(X’) ay 


[x — >| 
ae (F xs 
7 / = soa VR" eer) 
= / Hoey = do fo" ( VX —V'k ) Vv wot (4.48) 
Using Eq. (4.42c), we find that 
U(t,x) = U(é,X) + \x(W? + 6, + Sy), (4.49) 


and recalling Eq. (4.42b), we obtain finally, after dropping all bars on the right-hand-side, 


B90 = Soo — 2A1(B1 + 204 — Bs — Be) + 20(U? + & + Gy), 


Sr = Sik — 2X jx. (4.50) 


By an appropriate choice of A; and A2, we can eliminate certain terms from the post- 
Newtonian metric. We will thus adopt a standard post-Newtonian gauge, in which the 
spatial part of the metric is diagonal and isotropic (i.e., Xj; is eliminated) and in which gq 
contains no term ®s. There is no physical significance in this gauge choice; it is purely a 
matter of convenience (see Section 4.5 for other possible gauge choices). 

We now have a very general form for the post-Newtonian perfect-fluid metric in any 
metric theory of gravity, expressed in a local, quasi-cartesian coordinate system at rest with 
respect to the universe rest frame, and in a standard gauge. The only way that the metric of 
any one theory can differ from that of any other theory is in the coefficients that multiply 
each term in the metric. By replacing each coefficient by an arbitrary parameter we obtain 
a “super post-Newtonian metric theory of gravity” whose special cases (particular values 
of the parameters) are the post-Newtonian metrics of particular theories of gravity. This 
“super metric” is called the parametrized post-Newtonian (PPN) metric, and the parameters 
are called PPN parameters. 

This use of parameters to describe the post-Newtonian limit of metric theories of gravity 
is called the parametrized post-Newtonian (PPN) formalism. Primitive versions of such a 
formalism were devised and studied by Eddington (1922), Robertson (1962) and Schiff 
(1967). This Eddington-Robertson-Schiff formalism treated the solar system metric as that 
of a static, spherical, nonrotating Sun, and idealized the planets as test bodies moving on 
geodesics of this metric. The metric in this version of the formalism reads 


goo = —1+2M/r—28(M/r)’, 

gy = 9, 

Sik = (1+ 29M /r) dix.» (4.51) 
where M is the mass of the Sun, and { and ¥ are PPN parameters. 


These two parameters may be given a physical interpretation. The parameter 7 measures 
the amount of curvature of space produced by a body of mass & at radius 7, in the sense 
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that the spatial components of the Riemann curvature tensor are given to PN order by [see 
Eqs. (3.20) and (3.40)] 


M/.. ae ee oo 2 2 
Ria = 395 (jimsu + ANS — NNO — AyhOix 3 1k + =uby) ; (4.52) 


where # = x/r, independent of the choice of post-Newtonian gauge. The parameter 6 
is said to measure the amount of nonlinearity (M/r)* that a given theory puts into the 
Zoo component of the metric. However, this statement is valid only in the standard post- 
Newtonian gauge. The coefficient of U? = (M/r)* depends upon the choice of gauge, as 
can be seen from Eq. (4.50). In general relativity, for example (8 = y = 1), the (M/r)* 
term can be completely eliminated from goo by a gauge transformation that happens to be 
the post-Newtonian limit of the exact coordinate transformation from isotropic coordinates 
to Schwarzschild coordinates for the Schwarzschild geometry. Thus, this identification of 
8 should be viewed only as a heuristic one. 

Schiff (1967) generalized the metric of Eq. (4.51) to incorporate rotation (Lense-Thirring 
effect, Section 9.1), and Baierlein (1967) developed a primitive perfect fluid PPN metric. 
But the pioneering development of the full PPN formalism was initiated by Kenneth 
Nordtvedt Jr. (1968b), who studied the post-Newtonian metric of a system of gravitating 
point masses. Will (1971c) generalized the formalism to incorporate matter described by 
a perfect fluid. A unified version of the PPN formalism was then presented by Will and 
Nordtvedt (1972). The Whitehead potential ®j; was added by Will (1973). The version 
presented here is the canonical version, with conventions chosen to be in accord with 
standard textbooks such as MTW and PW. 

As in the Eddington-Robertson-Schiff version of the PPN formalism, we introduce an 
arbitrary PPN parameter in front of each post-Newtonian term in the metric. Ten parameters 
are needed; they are denoted ¥, 3, €, a1, a2, a3, G1, G, G, and C4. In terms of them, the 
PPN metric reads 


go0 = -1+2U+2(¥ — BU’) + O(e), (4.53a) 
20) = —5 [40 +7) +a] - sl! +a — 6 + 2€]X oj + O(e”), (4.53b) 
ge = (1+27U) 54+ O0(e), (4.53c) 
where 
Y= Qyt 1 bast G ~ 2608) —(28-1- Gr + (14.G)8s 


L (By +364 — 26) 4 — 5(Gr — 26) — Ey (4.54) 


Although we have used linear combinations of PPN parameters in Eqs. (4.53) and (4.54), 
it can be seen quite easily that a given set of numerical coefficients for the post-Newtonian 
terms will yield a unique set of values for the parameters. The linear combinations were 
chosen in such a way that the parameters a, and ¢, will have special physical significance. 

Other versions of the PPN formalism have been developed to deal with point masses with 
charge (Will, 1976b), fluid with anisotropic stresses (MTW, section 39), and bodies with 
strong internal gravity (Nordtvedt (1985) and Section 10.3). Extensions of the formalism 
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to post-post-Newtonian (2PN) order have been developed for light propagation (Epstein 
and Shapiro, 1980; Fischbach and Freeman, 1980; Richter and Matzner, 1982), for 
motion in spherical symmetry (Sarmiento, 1982), and for N-body motion governed by 
a Lagrangian (Nordtvedt, 1987; Benacquista and Nordtvedt, 1988; Benacquista, 1992). 
Additional parameters or potentials are needed to deal with some theories, such as theories 
with massive fields, where Yukawa-type potentials replace Poisson potentials (see Zaglauer 
(1990) and Helbig (1991) for the case of massive scalar-tensor theory), or theories like 
Chern-Simons theory, which permit parity violation in gravity (Section 5.6). 


4.3 Lorentz Transformations and the ppn Metric 


In Section 4.1.3, the PPN metric was devised in a coordinate system whose outer regions 
were assumed to be at rest with respect to the universe rest frame. For some purposes—for 
example, the computation of the post-Newtonian metric in a given theory of gravity—this 
is auseful coordinate system. But for other purposes, such as the computation of observable 
post-Newtonian effects in systems, such as the solar system, that are in motion relative to 
the universal rest frame, it is not a convenient coordinate system. In such cases, a better 
coordinate system would be one in which the center of mass of the system under study is 
approximately at rest. Again, this is a matter of convenience; the results of experiments 
cannot be affected by our choice of coordinate system. Because many of our computations 
will be carried out for such moving systems, it is useful to reexpress the PPN metric in a 
moving coordinate system. This will also yield some insight into the significance of the 
PPN parameters a, Q2, and a3 (Will, 1971d). 

To do this we make a Lorentz transformation from the original PPpN frame to a new 
frame which moves at velocity w relative to the old frame. In order to preserve the post- 
Newtonian character of the metric, we assume that w = |w| is small, that is, of O(e!/*). 
This transformation from rest coordinates x* = (t,x) to moving coordinates €” = (7, &) 
can be expanded in powers of w to the required order: this approximate form of the Lorentz 
transformation is sometimes called a post-Galilean transformation (Chandrasekhar and 
Contopoulos, 1967), and has the form 


x= €+(1+ Ww?) wrt 3€ ww Ex 02), 


1 1 
t=r(1l+-=w 4 -, t(l+-w)é-w4+7x O(e), (4.55) 
2 8 2 
where wr is assumed to be O(e°). 
We use the standard transformation law 
Oe” ON" 
V(T,§) = aa, Bap BaBlh*), 4.56 
Su (r g) raya aer® a( x) ( ) 


and express the potentials that appear in g.g(t, x) in terms of the new coordinates. Since 
p, Il, and p are all measured in comoving local Lorentz frames, they are unchanged by the 
transformation, as is the quantity p*d*x. Thus, for any given element of fluid, 
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p(t, x) = p(T, ve 
I(t,x) = I(r, €), 
P(t.x) = p(T,€), 
p* (t.x)@x = p* (7, €)@E. (4.57) 


If v(¢,x) and v(7, €) are the matter velocity fields in the two frames, they are related by 
v=viwtoe”). (4.58) 


The quantity x(t) — x’(t) that appears in the post-Newtonian potentials transforms 
according to 


O=(r=1)(14 57) +16) C1) -w4 702). 459) 


But in the (7, €) coordinates, the quantity € — €’ must be evaluated at the common time 7, 
hence we must use the fact that 


él(r') = E (7) + "(7 — 7) + O(7' — 7)’. (4.60) 


Combining Eqs. (4.59) and (4.60), we obtain 


1 _ 1 1 ~/\2 ~/ Poa 2 
kx] €-€) [1+ 50-8) +(w-n')(v' -n’’') + Ole )}. (4.61) 


where f’ = (€ — €')/|€ — €’|. Using Eqs. (4.57), (4.58), and (4.61), along with the 
definitions (4.31), (4.33), (4.34a), (4.37), and (4.41), we then find that 


U(t,x) = U(T, €) + w!W;(T, €) + seit Uul 8) + O(e*) ; 
(t,x) = (7, €) + 2w/V{(7,€) + wWU(7, €) + O(€), 
Do(t,x) = Bo (7, €) + 2w/W)(r, €) + w/wUp (7, €) + O(€), 
Vj(t,x) = Vi(r,€) + wU(r,€) + O18”), 
Xo(tx) = Xoj(7,€) — WX yr, €) + O18), (4.62) 


where 9 in Xo;(7,€) denotes 0/07. The post-Newtonian potentials ), ®3, 4, ®; and 
® yw are unchanged to this order. Applying the post-Galilean transformation Eqs. (4.55) and 
(4.56) to the PPN metric, Eqs. (4.53) and (4.54), and making use of Eqs. (4.62), we obtain 
for the metric in the moving (r, €) frame, to PN order, 


goo(T, €) =— 1 + 2U(r, €) + 2((r, €) — BU(r, €)’) 
— (a, — a3)w* U(r, €) + aw! w EX aT, ) + (203 — a1 )w/V;(7, §) 
+ (1 — ay — G + 26)w*X (7, €) + O(e?) 


5) 
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9 =— 54a + 7) + a ]Vj(7,€) - sll + a2 — C1 + 2€]Xoj(7, €) 
_ sanwiU(7, 8) + awX (7, €) 
+ 5(1 02-6 +26)! Xn(r,€) + OE”), 
Six = (1 + 27U(r, €)) Se + O(€), (4.63) 
where 
(7, €) = S27 +1 +03 +0 — 26)81(7,8) — 28 -1- G - Hhr(7,€) 


2 
+ (1+ Ga) O3(7,€) + By +3G — 26) Ga(7,8) 


— 5 (Gi, = 26)B6(.€) — EBy(7.8). (4.64) 


Because we now have available an additional post-Newtonian variable, w, we have an 
additional gauge freedom that can be employed without altering the standard PPN gauge. 
This gauge was selected in the frame in which w = 0, and was not affected by the post- 
Galilean transformation (spatial metric diagonal, no ®5 in goo). By making the additional 
transformation 

i (I —a.-G + 2€)w*X,, 

=e), (4.65) 
we can eliminate the term (1 — ay — ¢, + 2€)w*X y from goo and the term (1/2) 
(l-ag—Q + 26) WX jx from goj. 

This then becomes the standard PPN gauge in a coordinate system moving at velocity w 
relative to the universal rest frame, namely that gauge in which gj, is diagonal and isotropic, 
and in which the terms ®;5 and WX, og are absent from goo. It is then straightforward to 
show that a further post-Galilean transformation with velocity u (plus a possible gauge 
transformation to maintain the standard gauge) does not alter the form of the PPN metric, 
it merely changes the coordinate system velocity w that appears there to w + u. At 
first glance, one might be disturbed by the presence of metric terms that depend on the 
coordinate system’s velocity w relative to the universal rest frame. These terms do not 
violate the principles of special relativity since they are purely gravitational terms, while 
special relativity is valid only when the effects of gravitation can be ignored; but they do 
suggest that the gravitation generated by matter may be affected by its motion relative to 
the universe (violation of the Strong Equivalence Principle). Nevertheless, the results of 
physical measurements must not depend on the velocity w (this is a consequence of general 
covariance). For a system such as the Sun and planets, the only physically measurable 
velocities are the velocities of elements of matter relative to each other and to the center 
of mass of the system, and the velocity wo of the center of mass relative to the universal 
rest frame (as measured for example by studying the dipole anisotropy in the cosmic 
microwave radiation induced by Doppler shifts). Thus, the PPN prediction for any physical 
effect can depend only on these relative velocities and on wo, never on w. Therefore, the 
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terms in the PPN metric that depend on w must signal the presence of effects that depend 
on Wo. This can be seen most simply by working in a coordinate system in which the 
system under study is at rest, i.e., where w = wo. Then, if any one of the set of parameters 
{a 1, Q2, a3} is nonzero, there may be observable effects which depend on wo. On the 
other hand, if aj = ay = a3 = 0, there is no reference to w or wo in the metric in any 
coordinate system, and no such effects can occur. Thus, we see that the parameters a1, a2 
and a3 measure the extent and manner in which motion relative to the universal rest frame 
affects the post-Newtonian metric and produces observable effects. These parameters are 
called “preferred-frame parameters” since they measure the size of post-Newtonian effects 
produced by motion relative to the “preferred” rest frame of the universe. If all three 
are zero, no such effects are present, and there is no preferred frame (to post-Newtonian 
order). 

Notice that even if one works in the universal rest frame, where w = 0, physical effects 
will be unchanged, for even though the explicit preferred-frame terms are absent, the 
velocities of elements of matter y that appear in the PPN metric and in the equations of 
motion must be decomposed according to 


v=wWoty, (4.66) 


where v is the velocity of each element relative to the center of mass, and, unless ay, 
Q2, and a3 are all zero, the same effects dependent upon wo will result. At this point the 
PPN metric has taken on its standard form. Box 4.1 summarizes the basic definitions and 
formulae that enter the pPN formalism. 


The parametrized post-Newtonian formalism 


A. Coordinate system: A quasi-local Lorentz coordinate system (Section 4.1.3) in which the coordinates are 
(t,X1,X2,X3). Three-dimensional Euclidean vector notation is used. Coordinates are specialized to the 
standard pPN gauge (Section 4.2.2). 

B. Matter variables: 


1. ¢ = density of rest mass of a fluid element as measured in a local, freely falling, momentarily 
comoving frame. 

2. u# = four-velocity of the momentarily comoving frame, with v’ = u/ /u° = coordinate velocity of 
the frame (and thus of the fluid element). 

3. p* = pu’\/—g = “conserved” or baryonic mass density. Satisfies the exact continuity equation 
pat V - (p*v) =0. 

4. p =pressure as measured in a local, freely falling, momentarily comoving frame. 

5. I] = internal energy per unit rest mass. It includes all forms of non-rest-mass, nongravitational 
energy, such as thermal energy, magnetic energy, etc. 

6. w = coordinate velocity of the ppN coordinate system relative to the mean rest frame of the universe. 


C. PPN parameters: 


on B, &, 4, 2, 23, Gh On, iG Gio 
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D. Metric: 
gon = —1+ 20 + 2(h — BU?) + FF, 
1 1 
9 = 20 y+ jay sli Fay — C+ 26 Ky + Oi, 
Ge = (1 + 2-W) Op, 
where 


t (1+ ¢3)@3 + (3 + 3C — 2) By (G 2€) Bs — Eby. 


fe y/2 I = 
®, = = j ee ®, =| p " (x r * | dy’, 
=i, G Bee & 
aif! ele 
®, mm p 3 fe ®; ins, p dbx is 
[es — x"| 2 | — 
Dey an (<a x) Ga) a pay 
ty f oo ae [poet pare 


F. Preferred-frame potentials: 
pr = (a3 cam a) w’U ali (203 a ay)wiV; of aww'X x, 
1 
or = — soul ar anw'X jp 
G. Energy-momentum tensor (perfect fluid): 


1 
r= [1 + T+ -v + (2 au] 


2 


1 
TY = p*vi f + IT + A + (2—3y)U 4 slo") 
1 
TH = p* vik [ + T+ aN + (2 — 3y)U 4 ole" | + po*(1 — 2), 
H. Equations of motion: 
1. Hydrodynamics: 
ay =0, 
2. Test bodies 
ye ee dn = 
Aa Ee aye yy 


3. Maxwell's equations 
Yn Sel, IF 
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4.4 Global Conservation Laws 
————_—>SSS=S=aBE— SS Se SSS 


Conservation laws in Newtonian gravitation theory are familiar: for isolated gravitating 
systems, mass is conserved, energy is conserved, linear and angular momenta are con- 
served, and the center of mass of the system moves uniformly. This does not apply to every 
metric theory of gravity, however. Some theories violate some of these conservation laws 
at the post-Newtonian level, and it is the purpose of this section to explore such violations 
using the PPN formalism. 

One can distinguish two kinds of conservation laws: local and global. Local conservation 
laws are laws that are valid in any local Lorentz frame, and are independent of the metric 
theory of gravity. They depend rather, upon the structure of matter that one assumes. They 
include local baryon number conservation and the local conservation of energy or the first 
law of thermodynamics. These have been discussed in Chapter 3. 

Global conservation laws, however, are statements about gravitating systems in asymp- 
totically flat spacetime. Because they incorporate the structure of both the matter and the 
gravitational fields, they depend on the metric theory in question. 

When we attempt to devise more general integral conservation laws, such as for total 
energy, total momentum, or total angular momentum, we run into difficulties. It is well 
known that integral conservation laws cannot be obtained directly from the equation of 
hydrodynamics 7"”.,, = 0 because of the presence of the Christoffel symbols in the 
covariant derivative. Rather, we search for a quantity 7“”, which reduces to 7” in flat 
spacetime and whose ordinary divergence in a coordinate basis vanishes, that is, 


rH =0. (4.67) 


Then, provided that 7“” is symmetric, we find that the quantities 
P= / reds, Je =2 i: xl#ePayy, (4.68) 
5) = 


are conserved, i.e., the integrals in Eq. (4.68) vanish when taken over a closed three- 
dimensional hypersurface ©. If we choose a coordinate system (t,x) in which © is 
a constant-time hypersurface that extends infinitely far in all spatial directions, then, 
provided that 7“” vanishes sufficiently rapidly with spatial distance from the matter 
(thereby excluding for the moment a flux of gravitational waves), then P¥ and J#” are 
independent of time and are given by 


ph i ced, =e / eee (4.69) 


An appropriate choice of 7“” allows one to interpret the components of P# and J#” in 
the usual way: as measured in the asymptotically flat spacetime far from the matter, P° 
is the total energy, P/ is the total momentum, J” is the total angular momentum, and J” 
determines the motion of the center of mass of the matter. If a suitable 7” exists but is not 
symmetric, then P” is conserved but J4” varies according to 


LV 
a =-2 id Tel Bx, (4.70) 
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The quantity 7“”, often called the energy-momentum pseudotensor, has been found for 
the exact versions of general relativity and scalar-tensor theories (see Chapter 5); for a 
general discussion of the existence of such objects see Lee et al. (1974). A wide variety of 
nonsymmetric stress-energy pseudotensors have been devised and discussed within general 
relativity, but only the symmetric version guarantees conservation of angular momentum. 

There is a close connection between integral conservation laws and covariant Lagrangian 
formulations of metric theories. It is known (Lee et al., 1974) that every Lagrangian-based, 
generally covariant metric theory of gravity that either (1) is purely dynamical (possesses 
no absolute variables) or (ii) contains prior geometry, with a simple constraint on the 
symmetry group of its absolute variables (a constraint satisfied by all specific metric 
theories known), possesses conservation laws of the form r“”,, = 0, where 7“” is a 
function of certain variational derivatives of the Lagrangian of the theory that reduces to 
T“” in the absence of gravity. When there are no absolute variables, the conservation laws 
are the result of invariance under coordinate transformations, and the r“” are not tensors 
(or tensor densities), hence the name pseudotensors: moreover, there may be infinitely 
many of them. When absolute variables are present, their symmetry group produces the 
conservation laws and r“” typically are true tensors (or tensor densities). Although 7"” is 
guaranteed to exist for any Lagrangian-based metric theory, there is no guarantee that it 
will be symmetric, and no general argument is known to determine the conditions under 
which it will be symmetric. 

In the post-Newtonian limit, the existence of conservation laws of the form of Eq. (4.67) 
can be translated into a condition on values of some of the PPN parameters. We will attempt 
to construct a T“” by adopting a form, valid to PN order, 


7h’ = (1 — aU) (THY + th”) (4.71) 


where a is a constant to be determined and ¢”” is a quantity (which may be interpreted under 
some circumstances as “gravitational energy-momentum ’’) that vanishes in flat spacetime, 
and that is a function of the fields U,X, ©), ®y, V;,... and their derivatives, the velocity 
w, (and may also contain the matter variables p, p, II and v). We reject PN terms in T“” of 
the form v°7#”, IIT#”, (p/p)T#”, or w*T#”, since such terms do not vanish in general in 
regions of negligible gravitational field. 

By imposing the conditions 7“” ,, = 0, and 7"”.,, = 0 we find that, to post-Newtonian 
order, ¢#” must satisfy the equation 


tH”, —aU th” =T4, Te +7 THY 4 aU, Tl”. (4.72) 


In our attempt to integrate Eq. (4.72) we will make use of Box 4.1 and the definitions 
of the post-Newtonian potentials given in Section 4.2.1, together with formulae for the 
Christoffel symbols displayed in Box 6.1. We will also repeatedly exploit the following 
identity, which is valid for any function f: 


fa) 
Anpf; = 254 Tl) + U;V’f, (4.73) 


where 
1 
Tix(f) = Uf) = 2 nV U - Vf. (4.74) 
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Two additional identities will be useful. To lowest order, it can be shown that (U7°”) , = 
(p*U) 9 + (p*w/U),;; inserting the equations V7U = —4rp* and V7V; = —4rp*v, and 
manipulating the derivatives suitably, we obtain the identity 


4n(UT”) , — S\vur + UpUx + 2UjVig) = 0. (4.75) 


a) 
aa | 
A second identity, obtained after tedious calculations, is given by 

1 oO 
poy; = ae ae [27a(@) + 3UT (UV) + 20 jx(VX- VU) 


— De eX jx — XDej + Oje(XXeem) m 


sp 10 
+ 8p X GU + 5u(Ua) | + De Di (U;Uo) 
1 1 
+ U;| p*v’ + 2p |VU)? + —V7%6] , (4.76) 
87 An 
where »z is the solution of the equation 
V7de = —Arp*Ue. (4.77) 


Then, Eq. (4.72) can be put into the form 
Ant” , =4n (9 + %;) 


et | F 
= a 56 5 +2a 26)|V UP | 


f) 
mel 24a = Dig, $20) 3 -ea Pinna] 


+ Anba— a (um™), (4.78) 


Ant!” =4n Wy. + 1/* x) 
) 


1 
= 345 (4y+2+a — 2a2+2¢))U9U,; + (4y +44 a)Vigur 


1 F 
+ sa/uVU+ av,w- Vo)t 


id {| (Co + 4€ a)U4 (as ai)? | T(U) 
+ 2D 4(b) — 2EVx(WX- VU) + (1 + a2 — G + 2€)Dje(X) 


— 2(4y +4 + 04) (Vira = £5e¥ me Vint) 
+ (4y+4+ a1) (Wore - 5510 ¥s0) 


1 : 
_ a7 +2+a,—2a.+ 261) 5.0" 
+ €[2V?UX GU) + XjnDee +XDe eine — Fx(XDeem) m] 


ac (2a3 — a1)w'Tix(Ve) _ Zagw Tix (X¢) + anw w"T (Xen) 
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ae EG Vu) — Uw: vu| +e 


fa) 
+4n(S5y +a— Ia AUP +40", (4.79) 
X 
where ~ is given by Eq. (4.54), 
eA . 
Be 5 UV Vi + agwUjU — agw,U,(w- VU), (4.80) 


and the residual term Q/ is given by 
. 1 1 1 
Of = U,| ~(a3 + G) et’ + —OV' 85 + —O|VU? + Go" 
2 87 87 


+ 3p + anptw-y]. (4.81) 


We have also taken the liberty of adding 5 times the identity (4.75) to the right-hand-side 
of the time component f°”. The reason for this will be clear shortly. 

We first notice that Eq. (4.78) is automatically integrable, but Eq. (4.79) is not, because 
of the residual term Q/. Any attempt to write Q/ purely as a combination of gradients and 
time derivatives of gravitational fields and matter variables simply generates other residual 
terms of similar forms. Thus the integrability of Eq. (4.79) for arbitrary systems requires 
that each of the terms in Q’ vanish identically, and therefore that 


063 = 0 =GO=G=0=0. (4.82) 


These constraints must be satisfied by any metric theory in order that there be conservation 
laws of the form of Eq. (4.67). If these conditions hold, then expressions for the conserved 
energy and momentum can be obtained using Eqs. (4.69) and (4.71), combined with 
expressions for 7°° and T’° from Box 4.1, and for 1°° and ¢/° read off from Eqs. (4.78) 
and (4.79), respectively. The results are (after integrations by parts): 


1 1 
P® fe (1 + v= su+m) dx, 


. 1 1 1 1 
i =f |v (1 + a aa 5U+1+p/e) = 5 (1 +o) = 5 (1 + 02)X oj 
l 
— 5 (aw — anwXa) Pr. (4.83) 


These results are independent of the parameter a. In the expression for P®, the first term 
is the total conserved rest mass of particles in the fluid. The other terms are the total 
kinetic, gravitational, and internal energies in the fluid, whose sum is conserved according 
to Newtonian theory (which can be used in any post-Newtonian terms). Thus, P® is simply 
the total mass-energy of the fluid, accurate to O(c) beyond the rest mass, and is conserved 
irrespective of the validity of the conditions in Eqs. (4.82). This is related to the automatic 
integrability of Eq. (4.78). However, if those conditions were violated, one would expect 
violations of the conservation of P® at O(7). 

An alternative derivation of the conserved momentum uses the technique of integrating 
the hydrodynamic equations of motion 7“”.,, = 0 over all space, and searching for a 
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quantity P/ whose total time derivative vanishes. This procedure is blocked by a term 
f O/@x, where Q/, is given by Eq. (4.81). This integral can be written as a total time 
derivative only if O/ can be written as a combination of time derivatives and spatial 
divergences (which lead to surface integrals at infinity that vanish). But according to the 
reasoning given earlier, this can be true only if the five parameter constraints of Eq. (4.82) 
are satisfied. Then Q/ in fact vanishes, and the conserved P/ derived by this method agrees 
with Eq. (4.83). 

We now see the physical significance of the parameters a3, ¢1, G2, G3 and C4: they 
measure the extent and manner in which a given metric theory of gravity predicts violations 
of conservation of total momentum at PN order. If all five are zero in any given theory, then 
momentum is conserved; if some are nonzero, then momentum may not be conserved. 
According to the theorem of Lee et al. (1974), every Lagrangian-based metric theory of 
gravity must have all five conservation law parameters vanish. Notice that the parameter 
a3 plays a dual role in the PPN formalism, both as a conservation-law parameter and as a 
preferred-frame parameter. 

In order to guarantee conservation of the tensor J“”, the object #“” must be symmetric. 
Eqs. (4.78) and (4.79) show that ¢” 4 ¢/° and that there are nonsymmetric terms, B’ in ¢*. 
However, in integrating Eqs. (4.78) and (4.79), we have the freedom to add to the nominal 
solutions for ¢“” any quantity S“” that satisfies the identity S‘)” = 0. We have already used 
such an identity (4.75) in f°”, but we have been utterly unable to find an identity that will 
eliminate or symmetrize the offending terms B’ in t/*. As for the ¢” and t/° components, 
we can select the parameter b = Sy + a — | so that the term U7" in ¢/ matches the term 
UT” in t/°. The result is that 


1 1 
Srl — — 5 (a1 — 2a2)U Uy — UVM gy - 5uiwjUV'U — aU ,(w- VU), 
Srl) = 2BV4 = ay UwiV?Vq — 2a.wyU g(U— w- VU). (4.84) 
Symmetry of ¢“” requires that each of the terms in Eqs. (4.84) vanishes identically, that is, 
Y= oo = 0. (4.85) 


Any other choice of 5 will lead to the same constraints. 

We apply the name Fully Conservative Theory to any theory of gravity that possesses 
a full complement of post-Newtonian conservation laws: energy, momentum, angular 
momentum, and center-of-mass motion, that is, whose PPN parameters satisfy 


=a, =03=0 =O=G=4=0. (4.86) 


A fully conservative theory cannot be a preferred frame theory to post-Newtonian order 
since @} = @2 = a3 = 0. For such theories, only three PPN parameters, y, ( and € may 
vary from theory to theory; 7“” and ¢“” in this case take the form 


re = [1+ (Sy—1)U TH’ +2’, (4.87) 
where 


. 1 
j? — 3 (47 +3)|VUP, 


Tv 
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i =79 = = [27 + 1)UjUo + 4(7 + DUR pA] . 
i = (1 — A€U)Tix(U) + Dyes) — EN ja(WX- VU) + (1 + 26) e(X) 


1 
— 8(7 T 1) (Yin Vine) = oKkV ime Vint) 


1 1 ‘ 
aye 1) (Uorno = 5 OKU.e rio) = 5 (27 +1)? 


+ €[2V7UX GU) + XjxZee + XDe,e% — Ge(XZeem)m] » (4.88) 


where 


The conserved quantities are 


1 1 
pP® ={¢ (: + id aoe It ax 


. 1 1 
P! =f 0" E (145? -u++p/0") — Xa] © xe: 


; 1 1 
yi =| p*x! (: de 5 = sue It dx — Pit, (4.90) 
By defining a center of mass X/ given by 


_ fem leo = 70+) as 


Xi= ; 4.91 
for (1+ 4 — 5U+ Il) dx GN) 
we find from Eqs. (4.90) and the constancy of J” that 
dXi— Pl 
ea (4.92) 


that is, the center of mass moves uniformly with velocity P//P°. 
Some theories of gravity may possess only energy and momentum conservation laws, 
that is, their parameters may satisfy 


a3 G © G C4 0 7 one of {a, a} x 0. (4.93) 


We call such theories Semiconservative Theories. Their conserved four-momentum P“ 
may be obtained from Eqs. (4.83), but their nonconserved J” are ambiguous. A peculiar 
feature of the semiconservative case is that in a coordinate system at rest with respect 
to the universe, w = 0, and the spatial components ¢/* are automatically symmetric, 
irrespective of the values of a; and az, (since B/ = 0 if w = 0; see Eq. (4.80)). 
Thus, spatial angular momentum J” is a conserved quantity in this frame, whereas it is 
not in a moving frame. The center-of-mass component J*’, however, is not conserved in 
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any frame, since /” 4 ¢/° for any w. This discrepancy can be understood by noting that 
the distinction between J/* and J® is not a Lorentz-invariant distinction. Because the pPN 
metric is post-Galilean invariant, the quantities P* and J” should transform as a vector 
and antisymmetric tensor respectively under post-Galilean transformations. This can be 
verified explicitly by applying the transformation Eq. (4.55) to the integrals that comprise 
P¥ and J#”, with the result, valid to post-Newtonian order 


vA 1 
Pp =P (1450) —w-P, 

/ 1 2 0 1 
P=pP- 1+5w wP + 5w(w-P), 


JE — pk — fy Ay! +4 2 (: + 3) PlyA|, 


1 1 : : 1, 

Ji = (: =") JY — why sw wil ; (4.94) 
where w is the velocity of the boost. Thus a boost from the universe rest frame where 
dJ!* /dt = 0 to a frame moving with velocity w yields 

J = 2A [1 + Ow’)] . (4.95) 


Thus, the violation of angular momentum conservation is intimately connected with 
the violation of uniform center-of-mass motion. This is our reason for stating that 
semiconservative theories of gravity possess only energy and momentum conservation 
laws. Eq. (4.95) may be verified explicitly using Eq. (4.84), and the fact that to PN order, 


jhha9 : Mex, P= 2 / (By. (4.96) 


Every Lagrangian-based theory of gravity is at least semiconservative. 
Nonconservative theories possess no conservation laws (other than the trivial one for 
P°); their parameters satisfy 


one of {a3, C1, G, G, Ga} #0. (4.97) 


Table 4.1 summarizes the significance of the various PPN parameters. 


4.5 Other Post-Newtonian Gauges 


The choice of the standard ppN gauge has no physical implications; it is merely a choice 
of coordinates. It was made mainly for historical reasons: it was the gauge used by 
Chandrasekhar (1965) in his seminal papers on post-Newtonian hydrodynamics, and it 
was these papers that started the author along the path of post-Newtonian theory. In 
general relativity, another coordinate choice, called “harmonic” coordinates has proven 
to be extremely useful in developing systematic approximation methods to solutions of 
Einstein’s equation, both for post-Newtonian theory and for gravitational radiation theory. 


The Parametrized Post-Newtonian Formalism 


Table 4.1 pen parameters and their physical significance. 


Semi- Fully- 
GR conservative conservative 

Parameter What it measures relative to GR value theories theories 

oy How much spatial curvature 1 ay ay 
produced by mass? 

B How much nonlinearity in 1 B B 
superposition of gravity? 

€ Preferred-location effects? 0 g & 

ay Preferred-frame effects? 0 ay 0 

a2 0 Q2 0 

a3 0 0 0 

a3 Is total momentum 0 0 0 

G1 conserved? 0 0 0 

@) 0 0 0 

G 0 0 0 

ei 0 0 0 


Note that a3 is listed twice to indicate that it is a measure of two separate effects. 


This choice is the basis for the development of post-Minkowskian and post-Newtonian 
theory described in detail in PW, for example. In fact, we will use harmonic coordinates in 
Chapter 5 to derive the post-Newtonian limits of general relativity and scalar-tensor theory, 
and will use them again in Chapter 11 to analyze gravitational radiation in those theories. 

Accordingly, it would be useful to express the PPN metric in a coordinate system that is a 
generalization of harmonic coordinates that might occur in a range of alternative theories. 
In general relativity, harmonic coordinates are established via the condition 


gf”, =0,~, (4.98) 
where the “gothic inverse metric” g"” is defined by 


g S22", (4.99) 


where g is the determinant of the metric. In an alternative theory of gravity, this might not 
be the most appropriate choice, but we could imagine that a generalized post-Newtonian 
“harmonic” condition, 


[(1 — av) /-gg""], =0, (4.100) 


might be appropriate, where a is a parameter to be determined within the given theory, and 
U is the Newtonian potential. In general relativity, a = 0, and the gauge is the standard 
harmonic gauge. In Chapter 5, we will see that in scalar-tensor theories, the factor 1 — aU 
is related to a function of the scalar field. In what follows we will see that only the O(e) 
term in the factor is needed for the gauge condition at the first PN order. We will confine 


103 


4.5 Other Post-Newtonian Gauges 


our attention to semi-conservative theories of gravity, since the overwhelming majority of 
currently viable theories are Lagrangian based. 

To see what the condition (4.100) implies for the PPN metric, we first make a general 
gauge transformation of the form x" = x" + €, where 


£o = AXo + A3wlX;,  & = AX;. (4.101) 
The metric in the new coordinates takes the form 
20 = Boo — 2A1X 99 + 2A2(U? + By + Gy) — 2A3w/X gy, 
Boz = Boj — (Ar + A2)X oj — A2W*X jx, 
B5k = Bik — 22K jx, (4.102) 


and —g = 1 + (6y — 4.2 — 2)U. For the “generalized gothic inverse metric” go = 
(1 — aU),/—g2", we obtain 
9° = -1- 3y—2A.-a4+1)U+ O(2), 


ax 1 1 
g7 = [2¢9 F1)+ 501] V; 5 (1 + ag + 2€ + 2A; + 2X2) Xo 
1 ' 
_ puw + (a2 _ d3)W*X jx 
g™ = [1 + (y— 22 —1 —a)U be + 2X pe, (4.103) 
where the potentials are expressed in the new coordinates. Imposing the four conditions 


a5 = 0 we obtain the conditions 


1 : 1 
0= (« Trae a2 — 2& 21) U+ (2a - 501-23) 9 VU, 
0=(y-1-—a+22)U,, (4.104) 


which fix the values of A;, A2 and \3. Substituting these values back into Eq. (4.102), we 
obtain for the PPN metric in generalized harmonic gauge (we drop the overbars): 


S00 = —l +2U+ [2am — p94 = 1 —a) U"| 


1 os 
t ¢ a a P O9--F 26) i+ Diem ? 


1 1 PF 
By = — JAIL +7) + 501 | Yj — Jat l— 7+ gar |Xoj + Oitiarm 


gin = (1 +2) dp — (a +1 —7) Xe, (4.105) 


where 


1 3 1 1 
WHarm = 5 (27 + | 2€)®, (20 2 T al 74 ‘) b, + O3 


+ (37 — 2€)®,4 4 6B +5 (a y+1-—2€) Oy, 


prF 


Harm 


=—a,wU— ayw!V; + anw! WX jx = (202 - 01) w/X oy 5 


1 ‘ 1 : 
OFftam = uw + qo Xe. (4.106) 
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We can choose a = y — 1, and put the metric in the form 


1 - 
goo = —1 + 2U + [24 — 28U"] + (1 — 5a + 02 + 26)X + Dita 


1 1 
Ly =- [20 +y)+ 501] Vi- qoixoy a Draeas 


Sie = (1 + 2yV) dx, (4.107) 


where ~ is given by Eq. (4.89). Thus in a generalized harmonic gauge, the PPN parameters 
of the semiconservative theory can equally well be read off from the metric. 
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Post-Newtonian Limits 


Despite the fact that today general relativity is widely considered to be the “standard 
model” for gravity, there has never been a period in its history in which serious alternative 
theories were not being studied (here we exclude the many “crackpot” theories, which are 
not serious and which inhabit a world, and preprint archive, of their own). When the first 
edition of this book was published in 1981, the list of alternative theories looked entirely 
different from the list that we will present in this chapter. Many of the theories discussed 
then — Whitehead’s theory, Rosen’s theory — are no longer viable or of interest, and many 
of the theories discussed here — Einstein-A2ther, TeVeS, Chern-Simons, massive gravity, 
and so on — did not exist in 1981. On the other hand, there has been one alternative theory 
that, like the zombies of the movies, keeps recovering from near death and returning for 
more action: the scalar-tensor theory. 

The history of alternative theories of gravity can be divided roughly into three periods. 
The first period, from the publication of general relativity until the development of the 
Brans-Dicke scalar-tensor theory around 1960, was one in which alternative theories were 
invented in response to perceived difficulties with general relativity in the minds of their 
inventors. These difficulties included the relative lack of empirical support, problems with 
some of the concepts or consequences of curved spacetime, or the apparent absence of 
certain features, such as Mach’s principle. In addition to the Brans-Dicke theory, examples 
include Whitehead’s 1922 theory, Birkhoff’s 1943 theory, and Belinfante and Swihart’s 
1957 theory. Many of these early alternative theories were examined in detail in the review 
by Whitrow and Morduch (1965). 

The second period, from around 1960 to the mid 1980s, might be called the “straw- 
man” period. Armed with the deep understanding of the nature of gravitational theory 
provided by the Dicke framework (see Chapter 2) together with the pPpN framework, many 
investigators invented alternative theories, not because they believed in them necessarily, 
but to illustrate the kinds of effects that could be predicted and to explore and sharpen 
their differences with general relativity. They posed questions like “what does it take to 
have a theory with a non-zero value of a; or az?,” or “what if we turned the coupling 
constant w of Brans-Dicke theory into a function w(¢)? Even the author of this book, in 
collaboration with Kenneth Nordtvedt Jr. was guilty of inventing an alternative theory in 
this manner (see Section 5.4.2). To be fair, some theories, notably Nathan Rosen’s 1973 
bimetric theory, were invented for a real purpose and to be in competition with general 
relativity. 

The third period, beginning in the middle 1980s might be termed the “beyond-Einstein” 
period. This period opened with string theory, which revealed the potentially deep con- 
nection between the microscopic world of high-energy particles and the large-scale world 
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of gravity. Although developed in part in an attempt understand the strong interactions, 
string theory had the unexpected property that it seemed to predict general relativity in the 
small string-coupling limit. Yet the limit wasn’t pure general relativity, instead it could be 
interpreted as a scalar tensor theory with w = —1; this would strongly violate experiment 
(see Chapter 7), were it not for the expectation that the scalar field would acquire a large 
mass via spontaneous symmetry breaking, and thus its effects would be exponentially 
suppressed on any macroscopic scale, restoring an effective theory equivalent to general 
relativity to high accuracy. This and other connections with particle physics then provided 
the motivation for many of the alternative theories discussed in this chapter. Another 
motivation was the exploration of phenomena, such as violations of Lorentz symmetry, that 
might be relics of quantum gravity. A third motivation was provided by the 1998 discovery 
of the accelerated expansion of the universe. While the nominal “ACDM” cosmological 
model treated this as the result either of the presence of Einstein’s old cosmological 
constant A, or of a new form of matter, dubbed “dark energy,” the alternative possibility 
was a modification of general relativity itself at cosmological scales. 

We call this the beyond-Einstein period because, for the most part, the alternatives 
studied were not meant to replace or compete with general relativity, but rather to extend it 
into regimes far beyond those where it had been well tested, such as the solar system and 
binary pulsars, but with modifications motivated by particle physics, quantum gravity or 
cosmology. 

In this chapter, we will focus in some detail on general relativity and a variety of 
modifications of it generated by introducing additional scalar fields (the Brans-Dicke 
theory being the iconic example), vector fields or tensor fields. We will also briefly review 
a variety of theories that have been motivated either by the discovery of the acceleration of 
the universe, by considerations of how quantum gravity might change general relativity in 
the low-energy limit, or by efforts to make gravity a massive field. The number and variety 
of alternative theories of gravity has become so large that it is impossible to cover them all 
here. For a comprehensive review, see Berti et al. (2015). 

We begin in Section 5.1 with a discussion of the general method of calculating post- 
Newtonian limits of metric theories of gravity. We then discuss in turn, general relativity 
(Section 5.2), scalar-tensor theories (Section 5.3), vector-tensor theories (Section 5.4), and 
tensor-vector-scalar (TeVeS) theories (Section 5.5). Sections 5.6 and 5.7 briefly cover 
quadratic gravity and Chern-Simons theories, and massive gravity theories. Section 5.8 
will conclude this chapter with brief historical remarks about the rise and fall of some 
notable alternative theories. 


5.1 Method of Calculation 


Despite the large differences in structure between different metric theories of gravity, the 
calculation of the post-Newtonian limit possesses a number of universal features that are 
worth summarizing. It is just these common features that cause the post-Newtonian limit 
to have a nearly universal form, except for the values of the PPN parameters. Thus, the 
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computation of the post-Newtonian limits of various theories tends to have a repetitive 
character, the major variable usually being the amount algebraic complexity involved. In 
order to streamline the presentation of specific theories in the following sections, and to 
establish a uniform notation, we present a “cookbook” for calculating post-Newtonian 
limits of any metric theory of gravity. 

Step 1: Identify the variables: (a) dynamical gravitational variables such as the metric 
Suv, scalar field @, vector field K” (generally assumed to be timelike), tensor field B,,,, and 
so on; (b) prior-geometrical variables such as a flat or de Sitter background metric 7,,,, a 
cosmic time function 7, and so on; and (c) matter and nongravitational field variables. 

Step 2: Set the cosmological boundary conditions. Assume a homogeneous isotropic 
cosmology, and at a chosen moment of time and in an asymptotic coordinate system define 
the values of the variables far from the post-Newtonian system. With isotropic coordinates 
in the rest frame of the universe, a convenient choice that is compatible with the symmetry 
of the situation is, for the dynamical variables, 


0 : 
Spv = go) = diag(—co, Cl, Cl, C1); 


o— oo, 
K(k, 0,.0,.0); 
Buy + BO) = diag(—bo, bi, bi, bi), (5.1) 


and for the prior-geometric variables (these values are valid everywhere since these 
variables are independent of the local system), 


Nuv = diag(—1, 1, 1, 1), 
T =T, with VT = (1, 0, 0, 0). (5.2) 


The relationships among and the evolution of these asymptotic values will be set by 
a solution of the cosmological problem. Because these asymptotic values may affect 
the values of the PPN parameters, a complete determination of the post-Newtonian limit 
may in fact require a complete cosmological solution. This can be very complicated in 
some theories. For the present, we will avoid these complications by simply assuming 
that the cosmological matching parameters are arbitrary constants (or more precisely, 
arbitrary slowly varying functions of time). Notice that if a flat background metric 7 is 
present, it is almost always most convenient to work in a coordinate system in which it 
has the Minkowski form, for in many theories the resulting field equations involve flat- 
spacetime wave equations, which are easy to solve. Then the asymptotic form of g shown 
is determined by the cosmological solution. If 7 is present it is not generally possible 
(unless in a special cosmology or at a special cosmological epoch) to make both it and g 
have the asymptotic Minkowski form simultaneously. Of course, once the post-Newtonian 
metric g has been determined, one can always choose a local quasi-Cartesian coordinate 
system (see Section 4.1.3) in which it takes the asymptotic Minkowski form. The form 
that 7 now takes is irrelevant since, unlike g, it does not couple to matter. In theories 
without 7, it is usually convenient to choose asymptotically Minkowski coordinates 
right away. 
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Step 3: Expand in a post-Newtonian series about the asymptotic values: 


0 
Suv = go) + Pv > 


= oo(14+ WV), 
Ke = (K+, 2,2, ®), 
Buy = BY) + Buy. (5.3) 


Generally, the post-Newtonian orders of these perturbations are given by 


Poo ~ O(c) + O(), poy ~ OC”), Pik ~ OLE), 
VU ~ O(c) + O(e), 
M~ Ole) +O(2), M~ Oe”), 
bo ~ O(c) + O(e), by ~ O(/), bye ~ Ole). (5.4) 


Step 4: Substitute these forms into the field equations, keeping only such terms as are 
necessary to obtain a final, consistent post-Newtonian solution for p,,,. Make use of all 
the bookkeeping tools of the post-Newtonian limit (Section 4.1), including the relation 
(9/dt)/(0/Ax) ~ O(e!/?). For the matter sources, substitute the perfect-fluid stress-energy 
tensor T“” and associated fluid variables. 

Step 5 : Solve for poo to O(c). Only the lowest post-Newtonian order equations are 
needed. Assuming that poo — 0 far from the system, one obtains the form 


Poo = 2aU, (5.5) 


where U is the Newtonian gravitational potential, defined in terms of the “conserved” 
density p* [Eq. (4.31)], and where a may be a complicated function of cosmological 
matching parameters and of other coupling constants that may appear in the theory’s field 
equations (such as a “gravitational coupling constant”). To Newtonian order, the metric 
thus has the form 


200 = —co + 2aU + O(e’), 
&0j = o(e/?) ’ 
Bk = C10jk + OL€). (5.6) 


To put the metric into standard Newtonian and post-Newtonian form in local quasi- 
Cartesian coordinates, we must make the coordinate transformation 


- — (co)'/?x° ’ x = (c1) 2x ’ (5.7) 


which results in 299 = cp | goo, 207 = (coc1)~'/*go;, and 2% = cy ‘gj, and U = (c,)~!/2U 
(recall that p*d>x is invariant). We arrive finally at 
go =—l+ 2(acs'et/?)U + Oe); 


807 = (2?) ? 
Sik = Sik + O(e) . (5.8) 
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Because we work in units in which the gravitational constant measured today far from 
gravitating matter is unity, we must set 


Grotay = acy ey!” = 1. (5.9) 


The constraint provided by this equation often simplifies other calculations, however there 
is no physical constraint implied; it is merely a definition of units. 

Step 6: Solve for pj, to O(€) and po; to O(c/?). These solutions can be obtained from the 
linearized versions of the field equations. The field equations of some theories have a gauge 
freedom, and a certain choice of gauge often simplifies solution of the equations. However, 
the gauge so chosen need not be the standard PPN gauge (Section 4.2.2), and a gauge 
(coordinate) transformation into the standard gauge (Box 4.1) or the generalized harmonic 
gauge [Eq. (4.105)] may be necessary once the complete solution has been obtained. 

Step 7: Solve for poo to O(€”). This is the messiest step, involving all the nonlinearities 
in the field equations and many of the lower-order solutions for the gravitational variables. 
For example, if pj from Step 6 has the form pj, = 2a; Udjx + arXj,, then the determinant 
of the metric has the form 


-g= ee} 2(2 ste) ul. (5.10) 


Co C1 


The energy-momentum tensor 7”” must also be expanded to post-Newtonian order. Using 
Egs. (3.77), (5.6), (5.9), and (5.10), we obtain 


2 
poo — _P i+a+ v4 (2 252) u40(2)|, 


CoC; 2c0 Co Cl 
_ Pp j 3/2 
T= [+ oe] 
Coc, 
rik — Pv + P54 pole). (5.11) 
Coc} C) 


Step 8: Convert to local quasi-Cartesian coordinates (Eq. (5.7)) and to the standard PpN 
gauge (Section 4.2.2). 

Step 9: By comparing the result for g,,,, with Eq. (4.53) or with the PPN metric in Box 4.1 
(with w = 0), read off the PPN parameter values. 

In general relativity, it is convenient to expand not g,,,, but the gothic inverse metric, 
gi’ = V—ggi”¥ = nv — h*’, This leads to what is called the Landau-Lifshitz 
formulation of general relativity, an approach that is very useful in post-Newtonian theory 
and gravitational radiation (for a complete exposition, see PW). We will use this method 
to calculate the PPN parameters for general relativity in the next section. In others, such 
as scalar-tensor theories, it is convenient first to make a conformal transformation to an 
auxiliary metric 2... = f(¢)g,. and to rewrite the field equations in term of that metric. It 
turns out that the same Landau-Lifshitz approach that works in general relativity can then 
be applied using a gothic inverse metric defined using g,,,. 
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5.2 General Relativity 


5.2.1 Statement of the theory 


We begin with the “standard model” of gravitation, general relativity. 

(a) Gravitational fields present: the metric g. 

(b) Arbitrary parameters and functions: None. We will ignore the effects of either dark 
energy or a cosmological constant, which are too small to be directly measurable in the 
solar system or binary pulsars. 

(c) Cosmological matching parameters: None. 

(d) Field Equations: The field equations are derivable from an invariant action principle 
ol = 0, where 


1 


f= 
167G 


[ ev-eetx + Inc(q4 yr) » (5.12) 


where R = g'”R,,, is the Ricci scalar constructed from the Ricci tensor [Eq. (3.42)], G 
is the gravitational coupling constant, and J/g is the universally coupled nongravitational 
action. By varying the action with respect to g,,,, we obtain the field equations 


Guy = 80GT yy , (5.13) 


where Guy = Ruy — 5 ZyuvK is the Einstein tensor. Varying /yg with respect to the matter 
variables g4 yields the matter field equations, while the general covariance of [Ng yields 
the energy-momentum conservation equation 7” .,, = 0 if the matter field equations are 
satisfied (see Eq. (3.69)). This is compatible with the gravitational field equations (5.13) 
and the Bianchi identity G“” .,, = 0 satisfied by the Einstein tensor. This compatibility 
is not unique to general relativity. It holds in any Lagrangian-based theory that has no 
“absolute elements” (Lee et al., 1974). 


5.2.2 The post-Newtonian limit and the ppn parameters 


We will derive the post-Newtonian limit using the so-called post-Minkowskian theory, 
which is based on the Landau-Lifshitz formulation of general relativity. The reader is 
referred to chapter 6 of PW for a pedagogical development of this method. Because g is 
the only gravitational field present, we can choose it to be asymptotically Minkowskian 
without affecting any other fields. Thus, we can set c} = c) = 1 

We first define the gothic inverse metric (really a metric density), 


gi” = v—ge", (5.14) 
along with the object H°“8" = g®? gt” — g®”g?", and make use of the identity 


HeHBY |, = 2(-g)G°? + 16nG(—g) 08, (5.15) 
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where ¢'’ is the Landau-Lifshitz pseudotensor, given by 


fo 1 ap a 1 a vy 
(—8)@t = 7e—Gysa sat — a aGl + 50°’ gana ah? 


167G 
= 9 aug’ a” — 9 gua’ a4” + grug’?as a5" 
1 a 4 a VT o 
+ g (28 A ght —3g By) (29.907 _ GocGue)a a | : (5.16) 
We then impose a “harmonic” coordinate condition 
oc 4=0, (5.17) 
and introduce the potentials 
ROP Sap? = GP? (5.18) 


where 7°? = diag(—1, 1, 1, 1) is the asymptotic Minkowski metric expressed in Cartesian 
coordinates, and where ®* also satisfies the harmonic condition a = 0. The Einstein 
equations can then be expressed equivalently as a wave equation for h®’, 


ne? = ~16nG(—-g) (T +P + HP) , (5.19) 


where © is the flat spacetime d’Alembertian, —(0/0t)? + V7, and where oP is a 
“harmonic” pseudotensor, given by 


4 
(-g)10 = en = nevis) (5.20) 
Because h“? appears on the right-hand side of Eq. (5.19), either embedded within the 
matter source term, or quadratically in the Landau-Lifshitz and harmonic pseudotensors, 
the wave equation (Eq. 5.19) lends itself to iteration. Substitute the zeroth-order solution 
ho  — 0 into the right-hand side, solve the wave equation for the first-order solution Ace, 
substitute that into the right-hand side, solve, and so on (see PW for detailed discussion). 

To obtain the first iteration, we have that 7° = p*, TY = p*v/, T* = p*vivk + pd, 
—g = 1, modulo corrections of order €. Substituting 4” = 0 into the right-hand side of 
the wave equation (Eq. 5.19), it is straightforward to obtain 


A” —4GU+O(e), 

hy =4Gv! + 0(e!”), 

hi* = O(2). (5.21) 
With this starting point, we can use Eqs. (5.14) and (5.18) to express the metric as an 


expansion in terms of the potentials °°, keeping only the terms needed to obtain the post- 
Newtonian solution: 


(Oy -an Ole), (5.22) 


1 
20 = —1+ 5h 
gy = AY +0(6”), (5.23) 


T2 
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gn = 6! (1 + 5H) + O(e), (5.24) 
(—g) =14+A+ O(e). (5.25) 


We now choose units in which G = 1, so that g99 = —1 + 2U + O(e’). To complete the 
post-Newtonian metric, we need to obtain the second-iterated solution only for h° and for 
the trace h“, Substituting hi” into the right-hand side of the wave equation, making use 
of Eq. (5.11) with co = cy = a = 1| and 3a; + a2 = 3, and keeping only terms that will 
contribute to the required PN order, we obtain the wave equations for the second iteration, 


I 
Dh,” = 16x Grae n| + 14/VU)’, 


hh’ = —16n (p*v" + 3p) +2|VU. (5.26) 
Noting that 
1 

\vur? = 5VU —~ VS), (5.27) 

and that the solution of the wave equation DV = —47p” is the retarded solution 

* t— = / 
y= Pp ( |x eee, (5.28) 
x—x’| 


which can be approximated within the near zone by expanding the retardation about time ¢, 
2 


*(t,x! 3) 10 
y= f ae 5 frteatey +555 f on ex) eal +. 


=U+ sit o(e*/?), (5.29) 
we obtain the solutions 
AY = 4U+7U? + 26, — 28, +46; +2¥ + O(€/”), 
Akt — U? 4+ 4@, — 2) + 126, + O(€*/”). (5.30) 


The O(e*/) terms in Eq. (5.30) turn out to be trivial gauge effects, and thus the errors are 
actually of O(e?). Substituting into Eq. (5.25) we obtain the post-Newtonian metric 


go = -14+2U+2(p-W’)+X+0(e), 
gy = —4V,+ 0(e”), 


gi = Sx (1 +2U) + O(e). (5.31) 
where 
3 
= a — 6; +O, + 36,, (5.32) 


We can put this into the standard ppN gauge by making the coordinate transformation f = 
t— 5X. By comparing the result with the PPN metric displayed in Box 4.1 it is easy to read 
off the PPN parameter values 
Ve hale C=), 
a = 0) = 03 =O =O =G=40=0. (5.33) 
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Alternatively, we can remain in harmonic coordinates and compare the metric (5.31) 
directly with the version given in Eq. (4.105), with the parameter a = 0. Again, it is 
simple to read off the same PPN parameter values. Note that general relativity is a fully 
conservative theory with no preferred-frame effects. 


5.3 Scalar-Tensor Theories 


5.3.1 Statement of the theory 


Scalar-tensor theories have proven to be the most interesting, compelling and resilient of 
alternatives to general relativity. The first and most famous such theory was formulated 
by Brans and Dicke (1961), based on previous work by Fierz and Jordan. For a time 
during the late 1970s and early 1980s, mounting experimental evidence in strong support 
of general relativity caused interest in the Brans-Dicke theory to wane, but soon the theory 
returned, albeit in a generalized form, inspired by ideas from string theory, inflation, 
extra-dimensions, and so on [early generalizations were studied by Wagoner (1970) and 
Nordtvedt (1970)]. The theories are simple, adding a single scalar field to the metric tensor. 

(a) Gravitational fields present: the metric g and a dynamical scalar field ¢. 

(b) Arbitrary parameters and functions: A coupling function w(¢) and a potential U(¢). 

(c) Cosmological matching parameters: ¢o, the asymptotic value of the scalar field. 

(d) Field Equations: The field equations are derived from the action 


v= rare / [or a Osby - u(6)| Vaad'x + Inc(qaguv) (5.34) 


It is straightforward to vary the action with respect to g,,, and @ to obtain the field 
equations, 


81G i 
Gps = Tw a ae (0 _ 58090) 
1 
a: é (b:yv — SuvOg¢) (5.35a) 
ee to 5.64 2 


where T = g"”T,,, and Dg is the scalar d’ Alembertian with respect to the metric. 
There is an alternative formulation of the theory in which one introduces an auxiliary 
metric g,,, related to the physical metric g,,, by the conformal transformation 


. ? 
Suv = ao (5.36) 


where ¢ 1s a constant to be chosen later. This recasts the action of the theory into the form 


1 > 3+2 
7 me f 5 TE bub 7 7) V-ad'x + Ine(qa.d'Buv), 6.37) 
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where R is the Ricci scalar constructed from ,,,, G = G/do and V(b) = goU(4)/¢”. 
This formulation looks like a standard general relativistic action R\/—& plus a kinetic and 
a potential term for the scalar field. The appearance of ¢ in the nongravitational action 
would appear to violate the metric hypothesis, but this is an artifact of the redefinition of 
Zyv- In terms of the original metric g,,,,, this is still a metric theory. The representation 
of the theory in Eq. (5.34) is often called the “Jordan frame,” while the representation in 
(5.37) is called the “Einstein frame” (why this convention is the opposite of what one might 
have guessed a priori is a mystery). The Einstein frame is useful for discussing general 
characteristics of such theories, for certain numerical relativity calculations, and for some 
cosmological applications, while the metric representation or Jordan frame is most useful 
for calculating observable effects. The following redefinition of the scalar field, 


Avago, 
dA = = 
AO) 2 PB + awl", (5.38) 
~ 
puts the action into the even simpler form 
a Tre / [R - 28” pape — Vy)| Vadis + nega A(y)'Buv), 6.39) 
where V(y) = V(¢). This version was popularized by Damour and Esposito-Farése 


(1992a), who also studied multiscalar tensor theories. 
5.3.2 Post-Newtonian limit and ppn parameters 


We follow the general procedure described for general relativity in Section 5.2.2, but 
working in terms of the auxiliary metric gyg instead of the physical metric gag. We 
introduce the gothic inverse metric 


9°? = /—-ee", (5.40) 


the object Ho#8" = ge? grey — go” gH, and rely on the same identity (5.15) and Landau- 
Lifshitz pseudotensor (5.16), but with everything now expressed in terms of “tilde” 
quantities. We introduce the potentials 


AP sg? = gr? (5.41) 
impose the harmonic gauge condition 
co, (5.42) 
and arrive at the wave equation 
he = —16nG(—-g)(7°* + 74° +P +79"), (5.43) 


where T°? = ($9/@)?T°, #2? and | are the Landau-Lifshitz and harmonic pseudoten- 

sors of Eq. (5.20), but defined using h%?, and 

1 3 + 2w 
& 


(819° = 


1 
—— ge pf _ gee gue »—V(b)Bap | - 5.44 
bs = TG (2 lee ae Jone (o)g | (5.44) 
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The scalar field equation (5.35b) can also be expressed in the form of a wave equation 
in flat spacetime: 


$ = —8nGr,, (5.45) 


where 


1%; = —/-8 (74 am) 


3+ 2w 8nGdo 
| 1 d 3 + 2w ~oB 7 cap 
" 16nG do Et Pe ) |e Pap — 2h" bap ft (5.46) 


where T = T#”S,,,, = (¢0/)?T. Henceforth we will ignore the scalar field potential V(¢), 
which, among other effects, can give the scalar field an effective mass, and can generate 
self-interactions or inflationary behavior. 

We now expand ¢ = ¢o(1 + WV), where @o is identified as the asymptotic value of ¢ 
far from the system, and UV ~ O(c), and anticipating that h°° ~ O(c), hY ~ O(é/?), and 
hik ~ O(e2), we can use Eqs. (5.36) and (5.40) to express the physical metric to PN order 
in the form 


1- 3 29,2. ley 1 
=-1+ A" + m0 a U-W+O0(e 5.47 
Z00 af gi y+ 5h 5h Ole" )'s (5.47) 
gy = AX +. 0(8”), (5.48) 
7 i 
Sn = OF (: + 5h = v) LONE) 5 (5.49) 
(-g) =14+h" —40+0(2). (5.50) 


Then from the first iteration of the field equations (5.43) and (5.45), we obtain 
A” —4GU+ O(e), 
hy = 4GVi + o(e/"), 


hi = o(e), (5.51) 
and 
OD us 
= —"__ GU + O(e), 5 52 
Ta ca G22) 


where wo = w(do). Then, to Newtonian order, we have 


~4+ 2wW09 
=-1+2 = —14+ 2GraayU, 5.53 
Z00 eel + 2Gtoday (5.53) 


where we define the measured present value Gioday by 


~4+ 2wo 
Groday = G : 5.54 
ee Ae Dads ae) 
We then choose units in which Gioday = 1. Note that if @9 changes as a result of 


cosmic evolution, then Gtoday may change from its present value of unity. We will discuss 
experimental bounds on any time variation of Gtoday in Chapter 8. 


116 


Metric Theories of Gravity and Their Post-Newtonian Limits 


Proceeding to the second iteration of the field equations, noting that we must expand 
w(d) = wo + wo(doW) + ..., where wi = dw/dd|o, and defining the parameters 
1 ows 


> A= ; D5 
4+ 2wo (3 + 2w)(4 + 2w0) ( ) 


¢= 


we obtain finally 


5, ll 
£00 = 1+2U4 av (1+ CALE + 5X00 | + O(e), 


*U- Ovi +0(8?), 


80 = ae 

Sik = Fell + 2(1 — 2¢)U] + O(e’), (5.56) 
where 

y= 53 4¢)®, — (1 + 2¢A)®2 + 3 + 3(1 — 2¢) Oy. (5.57) 


Since ¢/¢9 = 14+ VU = 14+2¢U+O(ce’), it is simple to show that the gauge condition (5.42) 
corresponds to our generalized harmonic gauge of Chapter 4, Eq. (4.100), with a = —2¢, 
and so comparing Eqs. (5.56) and (5.57) with (4.105) and (4.106) (with w = 0, since we 
are working in the universal rest frame) we can read off the PPN parameters: 


1+w9 
bie ee ES 
/ 
B=1+0\=14 Pog (5.58) 


(3 + 2wo)(4 + 2w)? ’ 


and € = a; = ¢; = 0. Alternatively, we can bring Eq. (5.56) to the standard PPN gauge of 
Box 4.1 by making the coordinate transformation ¢ = f+ 5X, o and x/ = x/, and comparing 
the two metrics. 


5.3.3 General remarks 


In Brans-Dicke theory, w = wgp is strictly constant; the larger the value of wgp, the 
smaller the effects of the scalar field, and in the limit wap — ov, the theory becomes 
indistinguishable from GR in all its predictions. This is largely why the fortunes of Brans- 
Dicke theory declined by the late 1970s, as experimental data pushed the allowed value of 
Wpp higher and higher (into the thousands). As long as observations continue to support 
general relativity, Brans-Dicke theory with a large enough wgp can never be ruled out; the 
principle of Occam’s razor would have to be invoked to select the simpler theory, in this 
case general relativity. 

On the other hand, generalized scalar-tensor theories possess a richer phenomenology. 
The function w(¢) could have the property that, at the present epoch, and in weak-field 
situations, the value of the scalar field ¢o forces w to be very large, in agreement with the 
same observations that support general relativity. However, for past or future values of ¢, 
or in strong-field regions such as the interiors of neutron stars, w and A could take on values 
leading to significant differences from general relativity. 
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Damour and collaborators carried out in-depth studies of the possible differences. They 
first expanded In A(w) about a cosmological background field value yo: 


InA(y) = aol - yo) + xBolp — wo)? + (5.59) 


A precisely linear function produces pure Brans-Dicke theory, with a% = 1/(2wgp + 3), 
or 1/(2 +wpp) = 204/(1 + a2). The function In A(y) acts as a potential for the scalar 
field y within matter, and, if 69 > 0, then it turns out that during cosmological evolution, 
the scalar field naturally evolves toward the minimum of the potential where ¢ ~ ¢@o, that 
is, toward dA(p)/dy ~ 0, which implies w — oo. The result is a theory close to, although 
not precisely GR (Damour and Nordtvedt, 1993a, 1993b). Estimates of the expected relic 
deviations from GR today in such theories depend on the cosmological model, but in some 
models range from 10~° to a few times 10~’ for |y — 1]. 

However, negative values of {9 correspond to a “locally unstable” scalar potential (the 
overall theory is still stable in the sense of having no tachyons or ghosts). In this case, 
objects such as neutron stars can experience a “spontaneous scalarization,” whereby the 
interior values of y can be very different from the exterior values, through nonlinear 
interactions between strong gravity and the scalar field, dramatically affecting the stars’ 
internal structure and leading to strong violations of SEP (Damour and Esposito-Farese, 
1993, 1996). There is evidence from recent numerical simulations of the occurrence of a 
dynamically induced scalarization during the inspirals of compact binary systems contain- 
ing neutron stars, which can strongly affect both the final motion and the gravitational-wave 
emission (Barausse et al., 2013; Palenzuela et al., 2014; Shibata et al., 2014). In Chapters 7 
and 12, we will discuss the bounds that solar-system and binary-pulsar measurements place 
on the parameters ap and 3. 

On the other hand, in the case 6) < 0, one must confront that fact that, with an 
unstable y potential, cosmological evolution can drive the system away from the peak 
where dA(y)/dp ~ 0, toward parameter values that can be excluded by solar system 
experiments (Anderson et al., 2016; Anderson and Yunes, 2017). 

Scalar fields coupled to gravity or matter are ubiquitous in particle-physics-inspired 
models of unification, such as string theory (Taylor and Veneziano, 1988; Maeda, 1988; 
Damour and Polyakov, 1994; Damour et al., 2002a, 2002b). In some models, the coupling 
to matter may lead to violations of EEP, which could be tested or bounded by the 
experiments described in Section 2.3. In many models the scalar field could be massive; 
if the Compton wavelength is of macroscopic scale, its effects are those of a “fifth force.” 
Only if the theory can be cast as a metric theory with a scalar field of infinite range or 
of range long compared to the scale of the system in question (solar system) can the PPN 
framework be strictly applied. Alsing et al. (2012) worked out the post-Newtonian limit 
for finite scalar field mass, showing how the PPN parameters are effectively modified by 
Yukawa-type factors e~’/>, where r is the characteristic scale of the problem, and \ x m~! 
is the Compton wavelength of the scalar field. If the mass of the scalar field is sufficiently 
large that its range is much smaller than solar-system scales, the scalar field is suppressed, 
and the theory is essentially equivalent to general relativity. 

For a review of scalar-tensor theories, see Fujii and Maeda (2007). 
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5.3.4 F(R) theories 


These are theories whose action has the form 

T= ey | R)(-8) atx + Iroldan eye). (5.60) 
where f is a function chosen so that at cosmological scales, the universe will experience 
accelerated expansion without resorting to either a cosmological constant or dark energy. 
However, it turns out that such theories are equivalent to scalar-tensor theories: replace f{R) 
by flix) —fy (x) (R—-x), where y is a dynamical scalar field. Varying the action with respect 
to x yields f..(R — x) = 0, which implies that y = R as long as fy, # 0. Then defining 
a scalar field ¢ = —f,.(x) one puts the action into the form of a scalar-tensor theory given 
by Eq. (5.34), with w(¢) = 0 and ¢?V = dy(¢) — f(v(¢@)). As we will see, this value of 
w would ordinarily strongly violate solar-system experiments, but it turns out that in many 
models, the potential V(@) has the effect of giving the scalar field a large effective mass in 
the presence of matter, via the so-called “chameleon mechanism” (Khoury and Weltman, 
2004). As a result, the scalar field is suppressed or screened at distances that extend outside 
bodies like the Sun and Earth. In this way, with only modest fine tuning, /(R) theories can 
claim to obey standard tests, while providing interesting, non general-relativistic behavior 
on cosmic scales. For detailed reviews of this class of theories, see Sotiriou and Faraoni 
(2010) and De Felice and Tsujikawa (2010), and for a review of chameleon and other 
screening mechanisms, see Burrage and Sakstein (2017). 


5.4 Vector-Tensor Theories 


These theories contain the metric g and a dynamical, timelike, four-vector field K”. In some 
models, the four-vector is unconstrained, while in others, called Einstein-4¢ther theories it 
is constrained to have unit norm. 

(a) Gravitational fields present: the metric g and a dynamical four-vector field K. 

(b) Arbitrary parameters and functions: Five arbitrary coupling constants w, c1, C2, C3, 
and c4, and a constraint parameter . 

(c) Cosmological matching parameters: In unconstrained versions, Ky the asymptotic 
value of the vector field. 

(d) Field Equations: The field equations are derived from the action 


_ 1 
~ 167G 
+ Inc(q4, Zyv)s (5.61) 


il [a + wK,K")R ER aK a + A(K,.KY + )] (—2)'" d's 


where 
Eee = cig!’ gap + 26008 + ©3635, = c4K" KY gag. (5.62) 
Note that we have not included the possible term K"K”R,,,, as it can be shown under 


integration by parts to be equivalent to a linear combination of the terms involving c) 
and c3. 


119 


5.4 Vector-Tensor Theories 


5.4.1 Constrained Theories 


In the constrained theories, A is a Lagrange multiplier, and by virtue of the constraint 
K,,K" = —1, the factor wK,,K” in front of the Ricci scalar can be absorbed into a rescaling 
of G; equivalently, in the constrained theories, we can set w = 0. In a coordinate system 
in which the metric is asymptotically Minkowskian and the vector field has only a time 
coordinate (presumably the universal rest frame), the asymptotic value of K° is unity. 


Einstein-Ather theory 


The Einstein-Ather theories were motivated in part by a desire to explore possibilities 
for violations of Lorentz invariance in gravity, in parallel with similar studies in matter 
interactions, such as the SME. The general class of theories was analyzed by Jacobson and 
collaborators (Jacobson and Mattingly, 2001; Mattingly and Jacobson, 2002; Jacobson and 
Mattingly, 2004; Eling and Jacobson, 2004; Foster and Jacobson, 2006), motivated in part 
by Kostelecky and Samuel (1989). Analyzing the post-Newtonian limit,! they were able to 
infer values of the PPN parameters yy and £ as follows (Foster and Jacobson, 2006): 


y=1, B=1, 

8(c} + c1¢4) 

2c) — ch +.c3 
an (2c13 — C14) (C13 + C14 + 3c2) 
2 €123(2 — c4) 
with € = a3 = = @ = & = G = O, where cj23 = cy +2 +63, C13 = C1 +63, 
C14 = C; + ca, subject to the constraints c123 4 0, c14 A 2, 2c) — ct + e # 0. The present 
value of the gravitational constant is given by 


ag = 


(5.63) 


a = 


Groday = G (1 = a =, (5.64) 


By requiring that gravitational wave modes have real (as opposed to imaginary) 
frequencies, one can impose the bounds c)/ci4 > 0 and ci23/c14 > 0. Considerations 
of positivity of energy impose the constraints c) > 0, cj4 > 0 and cj23 > 0. Given these 
constraints, the conditions 

2 


C4 = =. and 
Cl 
: C13 
either c;3 = 0,or c. = a (c3 — 2c1) , (5.65) 
1 


are necessary and sufficient to guarantee that aj = a) = 0, so that the PPN parameters 
can be made to agree completely with those of general relativity. In many applications of 
Einstein-/ther theory, these constraints are imposed a priori, and one defines the two free 
parameters 


Ce =c, £063, (5.66) 


' Note that the minus sign in front the cy term in Eq. (5.62) compared to that in the references is a result of our 
convention for the signature of the metric 
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so that 
— Gy (eet Je) 
2 "3c, #e_) ’ 

(eee 

(CF io 
A(cy + c_) 
2c4¢_ 

= 5.67 
as ea (5.67) 


Another constraint on the parameters of Einstein-Ather theory comes from the requirement 
that the speeds of propagation of the various gravitational-wave modes (see Chapter 11) 
exceed the speed of light. Otherwise, a particle moving at nearly the speed of light could 
emit gravitational Cerenkov radiation and thereby lose energy (Caves, 1980; Elliott et al., 
2005). The observation of ultrahigh energy cosmic rays places such a strong constraint 
on this effect that it essentially forces all such speeds to exceed that of light, leading (in 
the limit of small values of a; and az) to the constraints 0 < c,; < landO <c_< 


c+/3(1 — cy). 


Khronometric theory 


This is the low-energy limit of “Horava gravity,” a proposal for a gravity theory that 
is power-counting renormalizable (Horava, 2009). The vector field is required to be 
hypersurface orthogonal, which implies that K° « 7°*, where 7 is a scalar field related to a 
preferred time direction. Equivalently the twist w°? = KI] + K!-.4) must vanish, where 
A° = K"K®.,, so that higher spatial derivative terms could be introduced to effectuate 
renormalizability. A “healthy” version of the theory (Blas et al., 2010, 2011) can be shown 
to correspond to the values cy = —e, cp = Ax, c3 = Bx + €, and c4 = ax + «€, where the 
limit € + oo is to be taken. The idea is to extract € times wa gw? from the Einstein-Acther 
action and let € — oo to enforce the twist-free condition (Jacobson, 2014). In this case 
y = 8 = 1, a, and az are given by 


pee 4(ax — 28x) 
Be-1 
— l : (ax — 26x) (ax + Be + 3x) (5.68) 


(2 — ax)(Bx + Ax) . 


and the remaining PPN parameters vanish. 
5.4.2 Unconstrained Theories 


In the unconstrained theories, 4 = 0 and w is arbitrary. Unconstrained theories were 
studied during the 1970s as “straw-man” alternatives to GR. In addition to having up to 
four arbitrary parameters, they also left the magnitude of the vector field arbitrary, since it 
satisfies a linear homogenous vacuum field equation of the form LAY = 0 (c4 = 0 in all 
such cases studied). Each unconstrained theory studied corresponds to a special case of the 
action (5.61), all with A = 0. 


121 


5.5 Tensor-Vector-Scalar (TeVeS) Theories 


General vector-tensor theory 


The gravitational Lagrangian for this class of theories had the form R + wK,,K"R + 
NKYKYR yy — €Pyp PY + TKy KY", where Fi, = Ky, — Ky, corresponding to the 
values c} = 2€ —7, Cc) = —7, ¢) + +c¢3 = —T, cq = 0. In these theories y, 6, 
a, and a, are complicated functions of the parameters and of the asymptotic value of 
K? = —K#K, y» While the rest vanish. This class of theories was first analyzed in the first 
edition of this book. 


Will-Nordtvedt theory 


This is the special case c} = —1, cp = c3 = cq = O. In this theory, the PPN parameters 
are given by y = 8 = 1, ay = K*/(1 + K/2), and zero for the rest (Will and Nordtvedt, 
1972). 


Hellings-Nordtvedt theory 


This is the special case c) = 2, cy = 2w, cy +O +c3 = 0 = cy. Here y, 6, a, and az 
are complicated functions of the parameters and of K’, while the rest vanish (Hellings and 
Nordtvedt, 1973). 


5.5 Tensor-Vector-Scalar (TeVeS) Theories 
| 


This class of theories was invented to provide a fully relativistic theory of gravity that 
could mimic the behavior of so-called Modified Newtonian Dynamics (MOND). MOND 
is a phenomenological mechanism (Milgrom, 1983) whereby Newton’s equation of motion 
a = Gm/r’ holds as long as a is large compared to some fundamental scale ao, but in a 
regime where a < ao, the equation of motion takes the form a? /ay9 = Gm/r*. With such a 
behavior, the rotational velocity of a particle far from a central mass where a < dg would 
have the form v ~ \/ar ~ (Gmao)'/4, thus reproducing the flat rotation curves observed 
for spiral galaxies, without invoking a distribution of dark matter. 

Devising a relativistic theory that would embody the MOND phenomenology turned out 
to be no simple matter, and the final result, TeVeS was rather complicated (Bekenstein, 
2004). Furthermore, it was shown to have unexpected singular behavior that was most 
simply cured by incorporating features of the Einstein-A:ther theory (Skordis, 2008). 

(a) Gravitational fields present: the metric g, a dynamical four-vector field K° and a 
dynamical scalar field ¢. 

(b) Arbitrary parameters and functions: four arbitrary coupling constants c1, C2, c3, and 
c4, a constraint parameter . for the vector field, and a coupling constant k, a length scale @ 
and an interpolating function ¥ for the scalar field. 

(c) Cosmological matching parameters: ¢o, the asymptotic value of the scalar field. 
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(d) Field equations: the field equations are derived from the action 


= a Le ou HM _¢)!/2 
aa | le BMY KK + (KK +1)]( g)/? atx 


-spng | Fl KOH” b.6v)(-8)'? dx + Ine (44, Bur); (5.69) 


where g,,, is an auxiliary metric related to the physical metric g,,,, by 
Sy =e °**S,,, — 2K,K, sinh(2¢), (5.70) 

R is the Ricci scalar constructed from the auxiliary metric, and 

Ee = c18" Bap + c26508 + 030504 — c4K"K’ Ba, 

hhY = gh” _ KEKY | (5.71) 
In the gravitational part of the action, indices are raised and lowered using the auxiliary 
metric. The interpolating function F(y) is chosen so that (vy) = dF /dy is unity in the 
high-acceleration, or normal Newtonian and post-Newtonian regimes, and nearly zero in 
the MOND regime. 

The PPN parameters have been computed in the limit where the function F(y) is a 
linear function of its argument y = k/?h“’¢,d,. In this limit, the scale ¢ drops out 
of the problem, and the PPN parameters (Sagi, 2009) have the values y = 2 = | and 
€ = a3 = G = 0, while the parameters a, and az are given by 
Kc, (2 — cya) — c3 sinh 4 + a —c1) sinh? 260 

2c, — cr + e 


a2 = (a2) RB —2G (41 = 2A> sinh 49 — A; sinh? 240) 5 (5.72) 


ay = (a1) R 16G 


where (a1) and (a2) m are given by their Einstein-£ther values in Eq. (5.63), and 


(2c13 — c14)? _ 4ei(2—e14) — 6(1 +13 — e14) 


A= 5.73 
€123(2— 4)  2e) — ea + a 2—c4 ( ) 
Ioy=egt- . a0— 2(1 — 2 3 
= oe = ay 2 sy ( + ) : (5.74) 
€173(2—cy4)? 2c) —c? +2 2-—cy \en3 2-cCy4 
= (213 = Gia) 4c3 Ai 2 (e = C13) 2¢13 — <i) (5.75) 
~ €193(2— 4)? 2ey —ct +H (2 — ey) C173 f25e 7 
where kK = k/87, 
1 6(2 — 
Grotey = 26 ( ss “)) =f; (5.76) 
2—c14 


and po is the asymptotic value of the scalar field. In the limit « + 0 and ¢9 > 0, ay, a2 
and G today reduce to their Einstein-Aither forms. 

When one takes into account the fact that the function ju(v) = d¥ /dy must interpolate 
between unity and zero to reach the MOND regime, it has been found that the dynamics 
of local systems is more strongly affected by the fields of surrounding matter than was 
anticipated. This “external field effect” (EFE) (Milgrom, 2009; Blanchet and Novak, 
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2011a, 2011b) produces a quadrupolar contribution to the local Newtonian gravitational 
potential that depends on the external distribution of matter and on the shape of the 
function (vy). For the solar system, this effect can be significantly larger than the 
galactic tidal contribution. Although the calculations of EFE have been carried out 
using phenomenological MOND equations rather than with TeVeS itself (for which the 
calculations would be very difficult), it is expected to be a generic phenomenon, applicable 
to TeVeS as well. Analysis of the orbit of Saturn using Cassini data has placed interesting 
constraints on the MOND interpolating function ju(y) (Hees et al., 2014). 

For thorough reviews of MOND and TeVeS, and their confrontation with the dark-matter 
paradigm, see Skordis (2009) and Famaey and McGaugh (2012). 

An alternative tensor-vector-scalar theory, containing a vector field and three scalar 
fields, in addition to the metric, was devised by Moffat (2006). 


5.6 Quadratic Gravity and Chern-Simons Theories 
 —SSCSCSC‘ésS 


Quadratic gravity is a recent incarnation of an old idea of adding to the action of GR terms 
quadratic in the Riemann and Ricci tensors or the Ricci scalar, as “effective field theory” 
models for more fundamental string or quantum gravity theories. The general action for 
such theories can be written as 


1= [ [rR aah (OR? + cfi(O)RaoR"? + axh(@)RoprsR™? 


+ aufa($) “RR — (203.00. + 2v(6)) (—g)'Pd'x 


+ Inc(q4,Zyv) > (5.77) 


where k = (167G)~!, ¢ is a scalar field, a; are dimensionless coupling constants (if the 
functions f;(¢) are dimensionless), 3 is a constant whose dimension depends on that of ¢, 
and *RR = 5€VPO Rg GRP ays. 

One challenge inherent in these theories is to find an argument or a mechanism that 
evades making the natural choice for each of the @ parameters to be of order unity. 
Such a choice makes the effects of the additional terms essentially unobservable in most 
laboratory or astrophysical situations because of the enormous scale of & x 1/l anc 
the leading term. This class of theories is too vast and diffuse to cover in this book, and no 
comprehensive review is available, to our knowledge. 

Chern-Simons gravity is the special case of this class of theories in which only the 
parity-violating term *RR is present (a; = a2 = a3 = 0) (Jackiw and Pi, 2003). It can 
arise in various anomaly cancellation schemes in the standard model of particle physics, 
in cancelling the Green-Schwarz anomaly in string theory, or in effective field theories of 
inflation (Weinberg, 2008). It can also arise in loop quantum gravity (Taveras and Yunes, 
2008; Mercuri and Taveras, 2009). The action in this case is given by 


; 1 
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[= / ja + GOR - (eta,.00.0 + 27(6)) (-g)!?d*x+Ine(q4,8uv), (5.78) 


where a and £ are coupling constants with dimensions (4, and (?4—7, assuming that the 
scalar field has dimensions ¢~“. 

There are two different versions of Chern-Simons theory, a non-dynamical version in 
which 3 = 0, so that ¢, given a priori as some specified function of spacetime, plays the 
role of a Lagrange multiplier enforcing the constraint *RR = 0, and a dynamical version, 
in which 6 £ 0. 

The PPN parameters for a nondynamical version of the theory with a = « and 6 = 0 are 
identical to those of GR; however, there is an additional, parity-even potential in the go; 
component of the metric that does not appear in the standard ppN framework, given by 

60: = oe (Vx V),. (5.79) 
Unfortunately, the nondynamical version has been shown to be unstable (Dyda et al., 2012), 
while the dynamical version is sufficiently complex that its observable consequences have 
been analyzed for only special situations (Ali-Haimoud and Chen, 2011; Yagi et al., 2013). 
Alexander and Yunes (2009) give a thorough review of Chern-Simons gravity. 

Einstein-Dilaton-Gauss-Bonnet gravity is another special case, in which the Chern- 
Simons term is neglected (a4 = 0), and the three other curvature-squared terms collapse 
to the Gauss-Bonnet invariant, R? — 4RagR%? + RagysR””, ie. fi(¢) = fo(¢) = A(¢) 
and a; = —a2/4 = a; [see Moura and Schiappa (2007) and Pani and Cardoso (2009)]. 

Berry and Gair (2011) studied f(R) theories in the small R limit, using the expansion 
f(R) = R +a )R*/2 +..., making this another special case of quadratic gravity; they used 
a variety of observations to place limits on the parameter ap. 


5.7 Massive Gravity 
>E—>>EE—E————o EEE Lt 


Massive gravity theories attempt to give the field carrying the gravitational interaction 
a mass. In a quantum context, such a field is often called the “graviton,” but the realm 
being discussed here is entirely classical; nevertheless, the term graviton is a convenient 
shorthand. The simplest attempt to implement this in a ghost-free manner suffers from the 
so-called van Dam-Veltman-Zakharov (vDVZ) discontinuity (van Dam and Veltman, 1970; 
Zakharov, 1970). Because of the three additional helicity states available to the massive 
spin-2 graviton, the limit of small graviton mass does not coincide with pure GR, and 
the predicted perihelion advance, for example, violates experiment. A model theory by 
Visser (1998) attempts to circumvent the VDVZ problem by introducing a nondynamical 
flat-background metric. This theory is truly continuous with GR in the limit of vanishing 
graviton mass; on the other hand, its observational implications have been only partially 
explored. Braneworld scenarios predict a tower or a continuum of massive gravitons, and 
may avoid the vDVZ discontinuity, although the full details are still a work in progress 
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(Deffayet et al., 2002; Creminelli et al., 2005). Attempts to avert the vDVZ problem 
involve treating nonlinear aspects of the theory at the fundamental level (Vainshtein, 1972); 
many models incorporate a second tensor field in addition to the metric. For recent reviews, 
see Hinterbichler (2012), de Rham (2014), and a focus issue in Classical and Quantum 
Gravity (Vol. 30, No. 18). 


5.8 The Rise and Fall of Alternative Theories of Gravity 


In this section, we will take a selection of theories from the first two periods of alternative 
theories and answer the question “Whatever happened to ...?”. 


5.8.1 Whitehead’s theory 


In 1922, the great mathematician and philosopher Alfred North Whitehead found himself 
uncomfortable with the fact that, in general relativity, the causal relationships among events 
in spacetime are not known a priori, but are only revealed after the field equations have 
been solved. In response, he proposed a Lorentz-invariant theory of gravity in which the 
physical metric g was constructed algebraically from a flat background Minkowski metric 
7 and matter variables, according to 
Suvks®) =v —2 [TE V=Epu dal”, 
s- (w7) 
(y4)- Sx" — (2), mun") O")> =0, 
Ww =qw)") ww). Wadd, 
do” = nudx"dx” , (5.80) 


where the superscript ~ indicates quantities to be evaluated along the past flat spacetime 
null cone of the field point x%. Eq. (5.80) are a modern version of the original theory. 
Whitehead required that clocks respond only to 7, thereby satisfying his concerns about 
causal relationships, but this meant that there would be no gravitational redshift effect, 
in disagreement with general relativity. But in 1922 this was not a problem empirically 
because attempts to measure the shift of spectral lines from the Sun had thus far failed to 
detect the effect predicted by general relativity. The full metric in Whitehead’s theory was 
assumed to govern the motion of light and matter via the geodesic equation. Furthermore, 
the theory had the property that, for a point mass at rest, the metric was equivalent via a 
coordinate transformation to the Schwarzschild metric. And once the gravitational redshift 
had been measured, it was natural to reinterpret Whitehead’s theory as a two-tensor theory 
with a flat background metric 7, and with the physical metric g governing all physics, 
including clocks. As a consequence, Whitehead’s theory could not be experimentally 
distinguished from general relativity using the gravitational redshift, light deflection, 
and perihelion advance, the main effects that supported Einstein’s theory. This led to a 
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conundrum discussed extensively by philosophers of science at the time of how to select 
among competing theories that equally satisfy experimental observations. 

But Will (1971b) pointed out that, when the theory was extended in a natural way to 
more than one gravitating body or to extended bodies, then the gravitational attraction 
between any pair of masses in the presence of a third body would be anisotropic, that is, 
dependent upon the orientation of the pair relative to the distant body (an effect additional 
to the normal tidal gravitational effects). This effective “anisotropy in Newton’s constant 
G” (see Section 6.6) would result in anomalous tide-like distortions of the Earth in the 
presence of the mass of the galaxy, that were ruled out by precise measurements made with 
gravimeters. This however did not totally end the fascination with Whitehead’s theory, 
especially among philosophers, and so Gibbons and Will (2008) embarked on a “serial 
killing” of Whitehead’s theory, pointing out that it actually fails the test of experiment in 
five different ways: 


Anisotropy in G. A reanalysis of Will’s 1971 result verified that the theory violates 
gravimeter data on anomalous Earth tides by a factor of at least 100 (see Section 8.4). 


Nordtvedt effect. The theory predicts a violation of the equivalence principle for 
gravitating bodies, leading to a Nordtvedt effect in lunar laser ranging 400 times larger 
than the data will permit (see Section 8.1). 

Birkhoff’s theorem and LAGEOS data. It was already known in the 1950s that the 
theory predicts that the metric of a static, spherically symmetric finite-sized body has 
an additional size-dependent contribution. This contributes an additional advance of the 
perigee of the LAGEOS II satellite, in disagreement with observations by a factor of 10 
(see Section 9.1.3). 

Momentum non-conservation. The theory predicts an acceleration of the center of mass 
of a binary system, an effect ruled out by binary pulsar data by a factor of a million (see 
Section 9.3). 

Gravitational-radiation damping. The theory predicts anti-damping in binary orbits due 
to gravitational radiation, in violation of binary pulsar data by four orders of magnitude 
(see Section 12.1.2). 


The purpose of this list of failures is not to gang up on Whitehead, but rather to illustrate 
that matching the Schwarzschild geometry is no longer sufficient to match experimental 
tests, and that the current generation of empirical data strongly and deeply constrains the 
theoretical possibilities. 

Whitehead’s theory is an example of a class of theories known as “quasilinear.” 
Quasilinear theories of gravity are theories whose post-Newtonian metric for a system 
of multiple bodies, in a particular post-Newtonian gauge, contains only potentials linear in 
the masses of the source bodies. In particular, in such a gauge, goo lacks the potentials U? 
and ®yy. This is a property of many theories that attempt to describe gravity by means of a 
linear field theory on a flat spacetime background. If the gauge in which this occurs is not 
the standard PPN gauge, then the gauge transformation (4.47) will yield 


Boo = Zoo + 2A2(U? + Sy + Gy) — 2A Xo. (5.81) 
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Since goo did not initially contain either U? or ®y, we can conclude immediately that £ = 3 
in this class of theories, in severe violation of experiment. 

Another group of theories in this class is known as Linear Fixed-Gauge (LFG) theories. 
The standard field theoretic approach to the construction of a tensor gravitation theory on 
a flat spacetime background is to use the gauge-invariant action for a spin-two tensor field 
Pv, combined with the universally coupled nongravitational action to yield 


1 Vv Vv Vv r Vv 
l= ml [2" Dyn” — 2ph P| +p) pl" =p? Mpa (—n)\/?d4*x 
167G 
+ Inc(d4s8uv) » (5.82) 


where gy. = Nv +Ppv- and “|” denotes a covariant derivative with respect to a Riemann- 
flat background metric 7. However, the action is singular: the gravitational part is invariant 
under the gauge transformation p,,, — Pyv — € (|v) While Jxg is not. The Bianchi identity 
associated with the gauge invariance of the gravitational action 7“”|,, = 0 is incompatible 
with that associated with the general covariance of Jyg, namely, 7”. = 0. (This 
incompatibility is the same one that many standard textbooks emphasize when discussing 
linearized general relativity.) LFG theories seek to remedy this by breaking the gauge 
invariance of the gravitational action through the introduction of auxiliary gravitational 
fields that couple to p,,, in such a way as to fix the gauge of p,,,. Nevertheless, these 
theories, devised by Deser and Laurent (1968) and Bollini et al. (1970), turn out to be 
quasilinear in the sense defined above, and predict € = (, in violation of experiment (Will, 
1973). 


5.8.2 Rosen’s bimetric theory 


Nathan Rosen was an assistant of Einstein during the late 1930s and collaborated with him 
on several seminal papers (the Einstein-Podolsky-Rosen paper, the Einstein-Rosen bridge, 
and a notorious paper on gravitational waves), but in the 1960s he convinced himself that 
black holes should not exist. Accordingly, he developed an alternative theory of gravity 
that would permit compact objects but without the event horizon. Rosen’s bimetric theory 
(Rosen, 1973, 1974, 1977) was the result. It is a metric theory based on the action 


1 Via 6 I 1/2 
I= ae | & Bay (sores aa ase) (—n) / d'x 


+ Inc(q4; pv) » (5.83) 


where 77 is a flat background metric, and “|” denotes covariant derivative with respect to 7. 
The PPN parameters turn out to be identical to those of general relativity Lee et al. (1976), 
except for a2, which is given by 


“” 


a 2 (5.84) 
c 


where cy and c; are cosmological matching parameters defined by 


co + 3c) = nv g) ; e- + ace = nuvg OR” . (5.85) 
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These parameters express the mismatch between the flat metric 7,,,, and the asymptotic 
form of the physical metric gf) determined by the cosmological model. Depending on the 
cosmological model, it was generally possible to choose co and c; so as to make a2 small 
enough to conform to solar system bounds (see Section 8.4). 

Death to Rosen’s bimetric theory came with the Hulse-Taylor binary pulsar. Despite 
the ability to agree with general relativity at post-Newtonian order, the theory predicted 
violations of the Strong Equivalence Principle in the presence of strongly gravitating 
neutron stars (Will and Eardley, 1977), and also predicted significant differences from 
general relativity for gravitational radiation. In particular, for the binary pulsar, it predicted 
dipole gravitational radiation with a negative energy flux (Will, 1977; Will and Eardley, 
1977), leading to a predicted increase in the binary pulsar’s orbital period, in strong 
contradiction with the observed decrease. 


5.8.3 Other theories 


In the years prior to general relativity, Nordstrém (1913), toyed with a conformally flat 
theory of gravity, in which the metric has the form g = /(¢)n, where 77 is the Minkowski 
metric, and ¢ is a scalar field. Irrespective of the field equations for ¢, the conformal 
flatness and a proper Newtonian limit require that g;, = —djgo0 = 6,;(1 — 2U). Therefore 
7 = —1, in strong disagreement with observation. This result can also be deduced from 
the conformal invariance of Maxwell’s equations (i.e., invariance under the transformation 
Suv — 8p), Which implies that propagation of light rays in the metric /(@) 1 is identical 
to propagation in the flat spacetime metric 7, namely, straight-line propagation at constant 
speed. 

Another class of alternative theories asserted that, in a certain coordinate system, the 
metric should always have the form ds? = —A()df +B(#) (dx? + dy’ + dz’), independent 
of the nature of the source, with ¢ determined by some field equations. Such theories 
can actually be made generally covariant by introducing a nondynamical background 
Minkowski metric 7, and a nondynamical scalar field 7 with the properties 7),,, = 0 and 
T,T yn” = —1. This scalar field acts like a cosmic time coordinate, selecting preferred 
spatial sections or “strata” that are orthogonal to V7. With these definitions, the metric in 
these “stratified theories with time-orthogonal space slices” takes the covariant form g = 
(B—A)dTSdT+Bn. Since the metric is diagonal in a coordinate system in which T,, = 5), 
(the preferred cosmic rest frame), independently of the nature of the source, then in the 
post-Newtonian limit and in the standard ppN gauge, we must have that go; = —eX 9; for 
some €. From Box 4.1, we then conclude that a; = —4(7+1) in gross violation of observa- 
tion. Some “straw-man” generalizations of this class of theories circumvented this problem 
by introducing auxiliary dynamical vector and tensor fields, in addition to the scalar 
field (Lee et al., 1974; Ni, 1973). It turned out however, that when the parameters were 
adjusted to conform to solar-system experiments, most of these theories predicted negative 
gravitational energy flux, in strong disagreement with binary pulsar data (Will, 1977). 


Equations of Motion in the ppn Formalism 


One of the consequences of the fundamental postulates of metric theories of gravity is that 
matter and nongravitational fields couple only to the metric, in a manner dictated by EEP. 
The resulting equations of motion include 


Te = [hydrodynamics and nongravitational fields] , 
wu’. = [neutral test body: geodesics] , 

BYY = Ant [Maxwell’s equations], 
Mi, = [light rays: geodesics] , (6.1) 


(see Section 3.2 for discussion). In Chapter 4, we developed the general spacetime metric 
through post-Newtonian order as a functional of matter variables and as a function of ten 
PPN parameters. If this metric is substituted into these equations of motion, we obtain 
coupled sets of equations of motion for matter and nongravitational field variables in 
terms of other matter and nongravitational field variables. For specific problems, these 
equations can be solved using standard techniques to obtain predictions for the behavior 
of matter in terms of the PPN parameters. These predictions can then be compared with 
experiment. It is the purpose of this chapter to cast the above equations of motion into 
a form that can be applied to specific situations and experiments. That application will 
be made in Chapters 7, 8 and 9. In Section 6.1, we carry out this procedure for light rays. 
Section 6.2 lays out the equations of hydrodynamics in the ppN framework, and Section 6.3 
deals with massive, self-gravitating bodies and presents appropriate N-body equations of 
motion. Section 6.5 specializes to semiconservative theories of gravity and presents an 
N-body Lagrangian and Hamilton from which the semiconservative N-body equations of 
motion can be derived. In Section 6.6, we derive the relative acceleration between two 
bodies, including the effects of nearby gravitating bodies and of motion with respect to the 
universe rest frame, and put it into a form from which one can identify a “locally measured” 
Newtonian gravitational constant. In Section 6.7, we derive post-Newtonian equations of 
motion for spinning bodies. 


6.1 Equations of Motion for Photons 
ny 


We begin with the geodesic equation obtained from Maxwell’s equations in the 
geometrical-optics limit, Eq. (6.1): 
WRK. =0, (6.2) 
29 
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where k* is the wave vector tangent to the “photon” trajectory, with 
K*k, =0. (6.3) 


Substituting k" = dx /do where a is an “affine” parameter measured along the trajectory, 
we obtain 
d*xt! Le BY dx* 7 
doz “da do 
As it stands, this equation is not particularly useful, since we have no experimental access 
to the parameter o. On the other hand, we do have access to coordinate time ¢. For example, 
the arrival of photons at Earth can be recorded in terms of atomic time fgtomic on Earth; this 
time can be directly related to PPN coordinate time ¢ = x° by well known transformations 
(see, e.g., PW, chapter 10). We can rewrite Eq. (6.4) using ¢ rather than o by noticing that 


(6.4) 


at aera 


do? do do O: >) 
Then the spatial components of Eq. (6.4) can be rewritten in the form 
dx! axl \ dx? dx 
T/ Yr? =0. : 
we ( BAS EO de ) oa ne) 
Eq. (6.3) can be written 
dx! dx” 
iw, ap 0. 6. 
Suv it dt eo) 


To post-Newtonian accuracy, Eqs. (6.6) and (6.7) take the form (see Box 6.1 for expres- 
sions for the Christoffel symbols I’ ,) 

dv > 

h = VU(1+ ) -2(1+ 7): VU), (6.8a) 
0=1-2U-V(1+270), (6.8b) 


where v = dx/dt, with |v| ~ 1. Eq. (6.8b) implies that v can be written as 
y=([1-(14+))U]n, (6.9) 


where n is a unit vector, satisfying n -n = 1. Substituting Eq. (6.9) back into (6.8a) yields 
an equation for the unit vector n, 


dni 

dt 
Notice that the right-hand-side of Eq. (6.10) is orthogonal to m, so that m remains a unit 
vector; only its direction changes. 

Eqs. (6.9) and (6.10), can be solved in a straightforward way. Consider a light signal 
emitted at PPN coordinate time ¢, at a point x, in an initial direction described by the unit 
vector k, where k - k = 1. The zeroth-order solution to Eq. (6.10) is n = k, so that the light 
trajectory is given to zeroth order by 


= (14+ 7)(6* — nin)U,. (6.10) 


x(t) =x. +k(t—t), (6.11) 
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where x, = x(t,). At the first PN order, we define 
n=k-+a, (6.12) 


where to post-Newtonian order, @ satisfies 


das ‘k ak 
= (14+) (6* — MAU, (6.13) 
with the initial condition a(t.) = 0, where U, is to be evaluated at x(¢), and U is given by 
Eq. (6.11), 


*/ 
ux() = f as, (6.14) 
s 
where s = x(t) — x’ and s = |s|. This gives 
da iB 
a -(1 +9 fp ‘Sex, (6.15) 
where 
b=8.—(se-k)k, Se=xXe—x’. (6.16) 


The vector b is directed from a point x’ within the source to the point of closest approach 
of the light ray relative to that point. To post-Newtonian order, the velocity v of the light 
ray is now given by 


v=([1-—(1+y7)U|k+a. (6.17) 


To solve for a and to integrate Eq. (6.17) to obtain the trajectory, we employ the useful 
identities, valid to lowest PN order: 


ds s-k 
d (s-k b 
a (=*) a (6.18b) 
d 1 
it In(s+s-k) = re (6.18c) 
We can then use the identity (6.18b) to express da/dt as 
da d ww DSK 
7 (14 ae ; ee: (6.19) 
which can be integrated to yield 
wD (Sk s.-k 
a(t) =—(1 +9 fp 3(— eer Jer. (6.20) 


Integrating Eq. (6.17) making use of the identities (6.18a) and (6.18c), we find the photon’s 
trajectory in the form 


X(t) =Xe + k(t — te) + kdxy(t) + 6x1 (t), (6.21) 
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where 


6x (t) =—(1 +7) / pe in oe Wee = 2] &x, (6.22a) 


6x, (t) =—(1+4) a‘ a(s _ *) Bx', (6.22b) 


assuming the initial conditions dx) (te) = 6x, (te) = 0. In the expression for dx), we have 
made use of the identity b? = (s. — s. - k)(se +s, - k). Equations (6.21) and (6.22) give a 
complete post-Newtonian solution for the trajectory of light around any mass distribution. 


6.2 PPN Hydrodynamics 
ee 


The PPN equations of hydrodynamics can be obtained by substituting the Christoffel 
symbols, displayed to the appropriate order in Box 6.1, into the equation of motion 
T“, = 0, using the post-Newtonian form of 7“” given in Box 4.1. Using coordinate time f 


Box 6.1 Christoffel symbols for the ppn metric 


To the post-Newtonian order sufficient for use in the equations of hydrodynamics or the geodesic equation for 
test bodies, the Christoffel symbols are given by 


0 
Do = —U, 

0 = » 
Py = Yj, 


1 1 
TY, = Vuln + = (47 +4 + cn Wop + = (1 + a2 — GF 2E)MK gi 
j 2 mm 
1 
+ 5 ow Uy) — an X on, 


| 
Dg = —Yj +28 + Uy = bj — 547 +4 + an)Min 


alll + a2 — C + 2€)X 09 4 5 (a a3)w'U; = sow wh xg 
1 ’ 1 
— sow ar 5 (m1 = 2a3)W'Vpj =F anw'X x0, 
Tie Oe 5 +4+4 ay )Vyq — som 


Phe = 5c + Gee — Seel)), (6.23) 


where 


v=sQrt tate Da Oy eo 


L (1-4 G)iBs + By +36 — 26), — 1G — 26) 8p — Eby. (628) 
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to parametrize the time evolution, and organizing terms to put the resulting equation into a 
form that matches the Euler equation (4.3) in the Newtonian limit, we obtain 


1 2) : 
Ea + (2—y)U+II4 P |p, wDo 


+ al — 2(y + BU] Uj + by — w [y+ 1)Up + 27 + Iv*Uy] 


1 1 
+547 +4401) [Ko + v*(Vin — Vi.) 4 51+ 02-6 +26)xef 


fl 
+ p soy ®i5 vor —of5)]. (6.25) 


where 7, ®** and ®'* are displayed in Box 4.1. 


6.3 Equations of Motion for Massive Bodies 
| en eee eee ee ee eee 


One method of obtaining equations of motion for massive bodies is to assume that each 
body moves on a test-body geodesic in a spacetime whose PPN metric is produced by 
the other bodies in the system as well as by the body itself (with proper care taken of 
infinite self-field terms). However, the resulting equations of motion cannot be applied 
to massive self-gravitating bodies, such as planets, stars, or the Sun (except in general 
relativity, as it turns out), because such bodies do not necessarily follow geodesics of any 
PPN metric. Instead, their motion may depend upon their internal structure (a violation of 
GWEP). This was first demonstrated by Nordtvedt (1968b). Therefore, one must treat each 
body realistically, as a finite, self-gravitating “ball” of matter and solve the hydrodynamic 
equations of motion (6.25) to obtain equations of motion for a suitably chosen center of 
mass of each body. For the purposes of solar-system and stellar-system experiments, it is 
adequate to treat the matter composing each body as perfect fluid. 

In Newtonian gravitation theory, this program is straightforward. By defining an inertial 
mass and a center of mass for body a according to 


Mg = foes, 
Xq=m;! / pxdx, (6.26) 


one can show, using the Newtonian equation of continuity (4.3), that 


dma 


dt : 
dx 
y= — =m," | prs, 
a 


dt 


_ dVg si dv 4 
aa = = Ma [°F % (6.27) 
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as long as there is no flux of matter from the body, such as a stellar wind, or mass ejection. 
Inserting the Newtonian Euler equation (4.3), we obtain the equation of motion for body a, 


a, = VS, (6.28) 
where 


] ; i 1 = 
w= [Mad (A 4 MA) waved +ovsh)], 629) 
a Ma Yab 


where mz, is the inertial mass of the bth body, pe 


moment tensor, given by 


is its symmetric, tracefree quadrupole 


r fs 1 ag 
[i = [o (x2! - sinPo") Px, X=x—Xp, (6.30) 
with analogous expressions for body a, and x,, and ray are given by 
Xab =Xa—- Xb, Vab= Xap : (6.31) 


Here we display the leading finite-size effects on the motion, proportional to the quadrupole 
moment of each body; a complete expression including all multipole moments may 
be found in PW, section 1.6.5. Because the gravitational potential is not uniform, the 
distribution of mass within a finite-size body can affect its own motion and that of other 
bodies in the system. These effects can have real-world consequences, such as tidal locking, 
orbital perturbations and the precession of the Earth’s angular momentum. In the limit in 
which the bodies are small compared to their separations, such effects decrease as (R/rap)’, 
with € > 2, where R is a characteristic size of a body. 

We now wish to generalize these equations to the post-Newtonian approximation, using 
the PPN formalism. We will ignore all finite-size effects that vanish in the limit of small 
bodies, such as the multipole moment terms we just described. Because there are many 
different “mass densities” in the post-Newtonian limit — locally measured rest-mass p, 
mass-energy density p(1 + II), conserved density p*, and so on — there is a variety of 
possible definitions for inertial mass and center of mass. The main requirements for a 
sensible definition of center of mass are that it be located somewhere inside the body 
(it should not wander too far off), that it be useful and convenient, and that it be used 
consistently in all developments. Once these requirements are satisfied, the freedom of 
choice is unlimited, and ultimately the most important requirement is one of convenience. 
Since fluid bodies in nature tend to be spherically symmetric to a high degree, except under 
extreme conditions such as tidal disruption, any reasonable definition of the center of mass 
will place it close to the geometrical center of the body. 

The choice we will make here has the advantage that the final equations of motion are 
simple (although “simple” is clearly a subjective statement). We define the inertial mass 
and the center of mass of the ath body to be 


Ma 


III 
so 
B=) 
* 
a 
— 
+ 
Nl 
<I 
N 
| 
NIlR 
[eal] 
+ 
q 
Se 
a 


1 1 = 
X=— | * (1 Lop Sout It) xx, (6.32) 
Ma Ja 2 
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where ¥ = v — v9), where, to lowest order, 


I 
Ya(0) = — / p*vdx,, (6.33) 


and where 


_ p. 
= post: (6.34) 
a \X—x 


In many post-Newtonian expressions, it will also be useful to define ¥ = x — x,(9), where, 
to lowest order, 


1 
Xa(0) = = i p*xd°x. (6.35) 


Note that, roughly speaking, mz, is the total mass-energy of the body — rest mass of particles 
plus kinetic, gravitational, and internal energies — as measured in a local, comoving, 
nearly inertial frame surrounding the body. As long as we ignore tidal forces on the ath 
body, which would be proportional to its finite size, then according to our discussion of 
conservation laws in the PPN formalism (Section 3.2.4), mg is conserved to post-Newtonian 
accuracy, that is, 

dma 

dt 

Making use of the equation of continuity for p*, and using Newtonian equations of motion 
in any post-Newtonian terms, we obtain 


i 1 oe 
ne aT flor (: 39 = 50411) y+ pp SoMa las, (6.37) 


where here and for future use we define 


=0. (6.36) 


- V-(x—-x)(x—-') 2 
12= o eae 6.38 
W.= | Saas (6.38) 
The acceleration a, is then given by 
aes 
“~~ dt 
1 1, Il. dv / = 
af? (1 5Y 504m) 2 x+ | (va: Vp)vd?x 
1 
+ [ (v0-2vp) #35 fp WP +5 Ta~ 57a +Pol (6.39) 


where 7,, 7," and P, are determined purely by the internal structure of the ath body. 
Formulae for these and other “internal” terms are given in Box 6.2. Notice that the 
acceleration of our chosen center of mass is more than just the weighted average of the 
accelerations of individual fluid elements, as it is in Newtonian theory. We now evaluate 
the first integral in Eq. (6.39) using the PPN equations of hydrodynamics (6.25). We make 
use of the Newtonian equations of motion where necessary to simplify post-Newtonian 
terms. For more details on how individual terms are evaluated, see PW, section 9.3. 
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Box 6.2 Integrals for massive bodies in the pen equations of motion 


Scalar integrals 


1 
——= fae d’x'd°x, 
2 J, |x — x’| 
r= | pdx, 
a 
f= / p*Tld’x. (6.40) 
a 


Tensor integrals 
pike / px Px, 
a 
c= | pr (Wik — XV) Bx, 
a 
, 1 . 
a [over 
2 a 
iK= / Wp ax, 
a 


= i i —_—— dy'dx, 
a 


2 |x — x’ 3 
z V(x —x')k 
i= f oe ee oe 
a He Pp jx — x’|3 
; vy. ee ee 
Ki = foo (x )( ) ( ) dbx’ ax. (6.41) 
a [x — x’) 
Vector integrals 
: v(x —x') 
tv = * / d’x'd°x, 
eae, 
We = [oo iy (x eae 
F [x — x’)? 
= - 
Tis il Fp a ae 
F [x — x!) 
Tee fot Ee ayy, 
a |x —x | 


Q! = / * ok! IT ( dbx” d3x' d3x 
TE ES [a xe xP 7 
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ads fore'o” (a 2) XO) pn angiehy 
|x! — x!’ 3lx — x//3 


; yA 
P! = ow (x ) d’x'd°x, 
a [xP 
j * Ok x =) J 
A= i p*p 1 d°y'dx. (6.42) 


Considerable simplification of the equations results if we make use of virial identities 
that connect a number of integrals in Box 6.2. These identities are given, to Newtonian 
order, by 


salt = soit at / pwr Px, 

1d Hk “jk Jk Jk 

aoe = 2Tj" + OF + 6"Pa, 

Le ik (ik) jk siky (ik) 

5 galt = tHe” — 3Ki + bP, — 209". (6.43) 


Two vectorial identities are given by 


d ae ; : 
G | Ovex = TE + Te + +P, 


d at : . ‘ ‘ : ; 
Gf o°Widx = 4-7) + Te! +378" - OY - PA, (6.44) 


a 


where we have discarded post-Newtonian corrections as well as terms that arise from 
external gravitational potentials, that can be shown to vary as O(R2/ ene where R, is the 
characteristic size of the ath body. 

We now assume that each body is in dynamical equilibrium. By this we mean that each 
body has had time, under its own internal dynamics, to relax to a steady state in which 
its internal structure does not depend on time. This means, in particular, that the structure 
integrals listed in Box 6.2 can all be taken to be time-independent. Then we can set equal to 
zero any total time derivatives of purely internal quantities, such as the left-hand-sides of 
Eqs. (6.43) and (6.44). This is a reasonable approximation for systems with well separated 
bodies, since any secular changes in the structure of the bodies that would prevent the 
vanishing of such averaged time derivatives occur over timescales much longer than an 
orbital timescale. 

The final form of the equations of motion is 


ag = (a) self + (@a)Newt + (@a)Nbody > (6.45) 


where 


: 1] 1 ae . ; . . ; 
(a2) seit = oe Ee +) + 581 (27, — 377") + QE + GEZ + 36eP:| 


a 


1 if 
— —az3(w+ Vq)* HY , 
Ma 


(6.46a) 
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. i . 
(aj) Newt = ——(mp)'Ux, (6.46b) 


Ma 


J 

‘ Mpn, 
(aboy = =D ee ye = 27 + 2) 06-98) + OV + DM 

b#a ab 


3 1 
— = (Nab . vp)? + 5 (a2 + a3) |w + |? 


2 
1 3 é 
— youl + ¥q)+(W+ vp) — 502 [Mab -(w+ yp) 
1 Ma My 
— (2742841 aia @) (2) + 28) | 
Yab Tab 
+ a S mas - [(27 + 2)¥a — (27 + Lvs] (v3 — vi) 
b#a en 
1 ; j 
=e = Nap * wt+y,)4 5 (Ya Vp) |(w + v4) 
b4a oe 


+e pm “ose 26) — 4 (28 —1—2€ oa) — 


ba c#a,b oe 

Stes ay — C1) (tga - Bye) = ee (n a) 

5 72, ab * Bbc pa, (Mac * Moc 
MpMej 


— 5 (4y +3 -2€ tay — 02 + G) oes: Wt. 


r 
b4a c#a,b Tab" he 


—§ 2 » _(n aoe Mh) (o# 7, 3n! ts) ’ (6.46c) 


b#a c#a,b tas 


where Map = Xab/Tab- 

The terms within square brackets in (dq)seir, Eq. (6.46a), depend only on the internal 
structure of the ath body, and thus represent “self-accelerations” of the body’s center 
of mass. Such self-accelerations are associated with breakdowns in conservation of total 
momentum, since they depend on the PPN conservation-law parameters a3, C1, ¢2, ¢3, and 
¢4. In any semiconservative theory of gravity 


B= =-O=-G=G=0, (6.47) 


and these self-accelerations are absent. Also note that spherically symmetric bodies, or 
bodies with reflection symmetry about the origin (as assumed in PW) suffer no acceleration 
regardless of the theory of gravity, since for them the terms t!, Ti, To’, 24, E/ and P/ are 
identically zero. The same is true for a composite massive body made up of two bodies in 
a nearly circular orbit, when the self-acceleration is averaged over an orbital period. Thus, 
there is little hope of testing the existence of these terms in the solar system because of the 
high degree of symmetry of the bodies and the high circularity of their orbits. However, in 
binary pulsars, where orbital eccentricities can be large, this effect has been tested, leading 
to a tight bound on ¢) (see Section 9.3). 

The final term in Eq. (6.46a), —m7!a3(w + va)‘HE , is a self-acceleration which 
involves the massive body’s motion relative to the universe rest frame. It depends on 
the conservation-law/preferred-frame parameter a3, which is zero in any semiconservative 
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theory of gravity. For any static body, ¥ = 0, thus H'"! is zero, but for a body that rotates 
uniformly with angular velocity w,, ¥ = wy x X, and thus 


: . x(x — x')J 
HY = cde ftp ad's 
a Ix — x | 


= (6.48) 


For a nearly spherical body, the isotropic part of 02” makes the dominant contribution to 
Eq. (6.48), that is, 


1 , oe ey 
Qin ~ Ziad", HY ~ 5 wa, (6.49) 
Then the final acceleration term in Eq. (6.46a) becomes 
1. Og 
— 503—(w+ vq) X Wa. (6.50) 
3 "Mm, 


In Section 8.4 we will see that this term can produce strikingly large accelerations in 
spinning pulsars, leading to very strong bounds on a3. 

The next term, (d,)Newt in Eq. (6.45), is the quasi-Newtonian acceleration of the body, 
but it contains post-Newtonian corrections that depend on the internal structure of both 
body a and the other bodies in the system. These corrections can be grouped together so 
as to appear as corrections to the “passive gravitational mass” of body a and to the “active 
gravitational mass” of the other bodies b. In fact, the passive mass turns out to be tensorial, 
given by 


a 


j ik Qa m om 
(mf = mo} 3 [1+ 48 pa PSE Oa Gi) sens | 


oF 
+ (26 — a2 + 01 — 2) (6.51) 
The quasi-Newtonian potential {((x,) has the form 
ye [ma (nas) |o 
U(xa) = >> — (6.52) 
b#a 
where the active gravitational mass of the bth body is given by 
1 1 Q% Ey 
[ma(itas)]o = may 1 + (48 — 7-3 -3€ - <03 — =0) -2) — + G— 
2 2 Mp Mp 
3 \P, 1 = gpl 
+ (3G — 1-503) + 5(G —26)n),ni,— F. (6.53) 
2 mp 2 Mp 


Note that the active and passive gravitational mass tensors may be functions of direction 
n,,. An alternative form of the quasi-Newtonian acceleration involves inertial, active 
and passive mass tensors that are independent of position, at the expense of having a 
complicated tensorial gravitational potential, as follows: 


(71) J (ae) Newt = (tnp)s"0"" , 


0 a mq nin, 
ue" =) (ing), (6.54) 


rT, 
ba ab 
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where the inertial, passive, and active mass tensors are given by 


Ma 


ik 
(ng =m fof fo on 0) 2] + (a2-G 4 ree |. 


Lm 
ino" = ma 5" 1 + (48 7-3 ed ee bs 


a Ma 


Sa et 1 1 OQ, 
Gin) = maf [1+ (49 ApS = BE 73 561 26q) 


mq 
+2 + (30 6 50s) mE] + 506 26) . (6.55) 
Mp 2 m 


b 2 Mp 


In Newtonian theory, the active gravitational mass, the passive gravitational mass, and 
the inertial mass are the same, hence each massive body’s acceleration is independent 
of its mass or structure, in accord with the Newtonian equivalence principle. However, 
according to Eq. (6.55), passive gravitational mass need not be equal to inertial mass in 
a given metric theory of gravity, and in fact both may be anisotropic; their difference 
depends on several pPN parameters, and on the gravitational self-energy (Q and Q/) of the 
body. This is a breakdown in the gravitational Weak Equivalence Principle (GWEP) (see 
Section 3.3), also called the “Nordtvedt effect” after its discoverer (Nordtvedt, 1968a,b). 
The possibility of such an effect was first noticed by Dicke (1964) (see also Dicke (1969) 
and Will (1971c)). The observable consequences of the Nordtvedt effect will be discussed 
in Chapter 8. Its existence does not violate EEP or the E6étv6és experiment (Chapter 2), 
because the laboratory-sized bodies considered in those situations have negligible self 
gravity, that is, (Q./m)tab bodies < 1073. 

According to Eq. (6.55), the active gravitational mass for massive bodies may also differ 
from inertial mass and from passive gravitational mass. In Newtonian gravitation theory, 
the conservation of momentum of an isolated system is a result of the law “action equals 
reaction,” that is, of the law “active gravitational mass equals passive gravitational mass.” 
In the PPN formalism, one can still use such Newtonian language to describe the quasi- 
Newtonian acceleration (a,)Newt. From Section 4.4, we know that momentum conservation 
is a property of semiconservative theories of gravity, whose parameters satisfy a3 = (; = 
0. By substituting these values into Eq. (6.55), we find that for semiconservative theories, 
the active and passive mass tensors are indeed equal, and are given by 


ik 
= i =m ff (08 o=3 36) ae (6.56) 


a Ma 
Equivalently, we can write (aq)Newt to post-Newtonian order in the form 


(thy)2*(a*\ Newt = Ymamof | 4 (48 —y—3 — 38) (2: 4: +) | me 


™m m 
ba a b ‘ab 


ate ON ne, 
a4 ab (957k — 3nf gk ) (6.57 
+é ( Ma Mp an ( "ap" ab ( ) 


The term in braces is manifestly antisymmetric under interchange of a and b, hence action 
equals reaction, and S~, (7)d*(a*)Newt = 0. 
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Note that in general relativity, the mass tensors of Eq. (6.55) are isotropic and equal to 
the inertial mass, that is, dropping the Kronecker deltas, 


(my) = (mp) = (ma) =m, [general relativity]. (6.58) 


There is no Nordtvedt effect in general relativity. However, in scalar-tensor theories, for 
example, the Nordtvedt effect is present; in those theories mj = mg, but 


ed 


a) 
Tul mi: (6.59) 


Mp => Ma = Mg 1 
For most practical situations, we may assume that the bodies in question are spherically 
symmetric, then using the approximation Qi ~ fosk Q, to simplify the mass tensors, we 


may write 


(agg, w= yma (6.60) 


(aj ) Newt — 


where we combine (m/")—! and (m") into a single quantity mp to obtain 


i 10 2 2. 1.\QQ 
(mr) =1+|48=-7—3 E-—a, + Faz G1 @) ; (6.61a) 
Ma 3 3 a0 3) ae 
m 10. 1 Q 
(ma)o _ | 4B —y-3 bee == 295 | 
Mp 3 2 Mp 
E, 4. 
a (sa -G- a) =, (6.61b) 
Mp 2 Mp 


The remaining term (a) Nbodys in Eq. (6.45) is called the N-body term. It contains the 
post-Newtonian corrections to the Newtonian equations of motion which would result 
from treating each body as a “point mass” moving along a geodesic of the PPN metric 
produced by all the other bodies, assumed to be point masses, taking account of certain 
post-Newtonian terms generated by the gravitational field of the body itself. It is the N-body 
acceleration which produces the “classical” perihelion shift of the planets, as well as a host 
of other effects, to be examined in Chapters 8 and 9. For the case of general relativity, the V- 
body terms in Eq. (6.46c) are in agreement with the equations first obtained by Lorentz and 
Droste (1917), and later by de Sitter (1916) (once a crucial error in de Sitter’s work has been 
corrected), Einstein, Infeld, and Hoffmann (1938), Levi-Civita (1965), and Fock (1964). 


6.4 Two-Body Systems 
SSS eee ee aa 
6.4.1 Effective one-body equation of motion 
We now truncate the PPN N-body equations of motion (6.45) and (6.46) to VN = 2. We 
consider a system of two bodies of inertial masses m, and m and self-gravitational energies 


Q, and Q2. Both bodies will be assumed to be reflection symmetric about their centers of 
mass, so that the self-terms in Eq. (6.46a) can be dropped, but we will let the second body 
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have a small quadrupole moment i as defined in Eq. (6.30). We assume that the entire 
system is at rest with respect to the universe rest frame (w = 0) and that there are no other 
gravitating bodies near the system. In Chapter 8 we will return to the effects of motion 
of the system and of distant bodies (preferred-frame and preferred-location effects) on the 
two-body problem. For the moment we ignore them. We work in a PPN coordinate system 
in which the center of mass of the system is at rest at the origin. Making use of the fact that 
each body is nearly spherical, Of ~ 50, we obtain from Eqs. (6.45) and (6.46) the 
acceleration of each body 


m mn m 1 
= eh Viti 4 2 [27 +26)" + (274284145 =a — a) wt 

1 1 nS ; 

+ 54 +4 + a1)(v1 + v2) — 527 +2 + a2 + a3)v2 + 5 (1 + a2)(m- v2) 
m2n 

+ = + [27 +2) — 27 + 1a] 1 
1 mon 

a [(4y+4+a1)y — (4¥ +24 a1 — 2a2)r2] 2, 

a, = {1 =2;n— —n}, (6.62) 


where x = X12, nm = x/r, and mp is given by Eq. (6.61a). Including the Newtonian 
contribution of the quadrupole moment in the quasi-Newtonian potential produced by body 
2 as shown in Eq. (6.29), we have 


n! 15 ke nike) 
(4,1 = —(ma)ay — SHO, 
n’ 15 (ma): y nee) 
(y= hat Taye a (6.63) 


where ma is given by Eq. (6.61b), and n\*") is the symmetric tracefree combination of unit 
vectors, given by 


nV) = pinky? — r (nid + nk +n 5M) (6.64) 


In Eq. (6.63), we have ignored the difference between the inertial and active masses in the 
quadrupole contributions, and used active masses throughout. Even though the quadrupole 
terms are of Newtonian order, they are generally very small, and thus this is a reasonable 
approximation. For a body which is axially symmetric about an axis with direction e, it can 
be shown that 


mn n 1 n 
Le = —(ma)2R5J2Q2) (oe Ken — qe ) . (6.65) 


where R) is the mean radius of body 2, and J(2) is its dimensionless quadrupole moment 
(€ = 2), given for general axisymmetric bodies by 
C-A 
mR?” 
where C and A are the moments of inertia about the symmetry axis and about an axis in the 
equatorial plane, respectively. 


j= (6.66) 


143 


6.4 Two-Body Systems 


Since the center of mass is at rest at the origin, we may replace v; and yp in the post- 
Newtonian terms in Eqs. (6.62) by 


y=—Vv, wW=-—y, (6.67) 
m m 
where 
V=EV-W, M=m+mM. (6.68) 


We also define the dimensionless reduced mass parameter 
mm 
= ———.... 6.69 
7 (m, + my)? ( ) 
Then the relative acceleration a = a, — a» takes the form of an effective one-body equation 
of motion, 
mn 3 mR5J22) 
a 2 


+ Say +28 +912 +01 - 20) 


[Sn(e-n)* —2(e-n)e—n] 


m 
r 


1 3 
—=2y4+ (6+ a1 +02 + a3)|v + ue. +a2)(v- n)'| 


2 
Ee vial ead py 42 FO ay ay), (6.70) 
where 
m= (mp)2 (ma), + ids 
my, 
=m-4 [self energy terms for bodies 1 and 2]. (6.71) 


The self-energy terms from Eqs. (6.61b) and (6.61a) that appear in the above expression 
are constant if the bodies are in quasistationary equilibrium. Furthermore, observations of 
a binary system can only measure the mass m”*, via Kepler’s third law, for example, and 
thus the self-energy corrections are unmeasurable. Henceforth, we will simply drop the (*) 
in Eq. (6.70). 


6.4.2 Perturbed two-body orbits and the Lagrange planetary equations 


Here we provide a brief review of standard orbital perturbation theory, used to compute 
deviations from Keplerian two-body motion induced by perturbing forces, described by 
the equation of motion 


 bbas (6.72) 


where 6a is a perturbing acceleration. For a general orbit described by x and v = dx/dt, we 
define the “osculating” Keplerian orbit using a set of orbit elements: the semilatus rectum p, 
eccentricity e, inclination v, nodal angle 2 and pericenter angle w, defined by the following 
set of equations (see Figure 6.1): 
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pericenter 


ascending node 


Elements of a general relative two-body orbit. 


x=rn, 
r=p/(1+ecosf), 
n = (cosQ cos ¢ — cosvsinQ sin d) ey 


+ (sinQ cos d + coszcos 2 sin ¢) ey 


+ sinvsin dez, 
A=An/dO, h=nxX, 
h=xxv= mph, (6.73) 


where f= ¢ — w is the true anomaly, ¢ 1s the orbital phase measured from the ascending 
node, and e, are chosen reference basis vectors. From the given definitions, we see that 
v =in+ (h/r)A and+ = (he/p) sinf. The semilatus rectum p is related to the semimajor 
axis a by p = a(1 — e”). One then introduces the radial R, cross-track S and out-of-plane 
W components of the perturbing acceleration da, defined, respectively, by R = n- 6a, 
S = X-daand W =h- 6a, and writes down the “Lagrange planetary equations” for the 
evolution of the orbit elements, 


dp = Dp S 

dt  Vm1-+ecosf’ 

de [p[. .., , 2cosft+e+tecos*f 

& = [2[sinrr t 1+ ecosf S 5 


dw 1 [p ; 2+ecosf . sin (w +f) 
—=- /—|- R4 S t : ; 
dt -/2| cost 1+ rr a ey 1+ mrad 


m 
du |p r 
= [ew (£) cos ¢, 


dQ _ Pw) (£) sing (6.74) 


dt m p/ sine 
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(see PW, section 3.3 for further discussion.) We have not listed a sixth equation for the 
variation of 7, the “time of pericenter passage”; the six elements p, e, w, 1, and 9 and T 
provide a complete characterization of the six orbital variables x(t) and v(t). The element 
T is relevant if the evolution of the orbit with time is the main focus. But for many 
applications, the evolution with orbital phase for ¢ is the focus. In the latter case we use 
Eq. (6.74) and close the system by providing equations that connect for @ with t, given by 


df _ h dw 

dP dt at’ 

db oh dO 

a COS L a (6.75) 


These equations are exact: they are simply a reformulation of Eq. (6.72) in terms of new 
variables. However, they are particularly useful when the perturbations are small, so that 
solutions can be obtained by a process of iteration. At lowest order (no perturbations), the 
elements p, e, w, 9, and are constant, and df/dt = ,/mp/ 7’; those solutions can then be 
plugged into the right-hand side and the equations integrated to find corrections, and so on. 
The corrections to the orbit elements tend to be of two classes: periodic corrections, which 
vary on an orbital timescale, and secular corrections, which vary on a longer timescale, 
depending upon the nature of the perturbations. 

In many applications, such as planetary orbits in the solar system, the orbital inclinations 
are very small, and it is difficult to separate the variation of w and of 2, instead one 
measures variations in the pericenter direction within the orbital plane, relative to the X 
reference axis (the equinox direction, in the case of the solar system). The variable that 
describes this variation is 


dw _ dw 4 
ad dtd 
_ 1 /p 2+ecosf . 
oe |- cosfR + eet inf : (6.76) 


We are mainly interested in “secular” variations of the orbital elements induced by the 
perturbations shown in Eq. (6.70). The components of the perturbations are given by 
_ 3 mR J, 


R= 5 [s(e-m — 1] + | By +28 + n(2 +01 ~ 20) 


x[3 


= 5 Br m6-+ 01 baa tan)? + 5 fy +4 (1 — 2a ~ 09) | 


2, 
S= 3 (e-n)(e-A)+ “(¥-n)(¥ -A) [2y +2 -—7(2-—a1 +02), 
2 
W = 3 (e-n)(e-h), (6.77) 


where we have dropped the subscript “2” from R and Jz. We will assume that the X¥ — Y— Z 
reference basis is chosen so that the symmetry axis of the body with the quadrupole 
moment is in the Z-direction. To calculate the changes in the orbit elements to first order 
in the perturbations, we substitute Eq. (6.73) into (6.77) and substitute R, S and W into 
Eq. (6.74). We convert the equations from d/dt to d/df using df/dt = h/r’, and integrate 
over f from 0 to 27. The resulting secular changes in the elements per orbit are given by 
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Ap =0, 
Ae=0, 
Ac =0, 
6am | 1 
Aw = (2 + 27 — 8) + = (20 — a2 + 03 + 26) 
p 3 6 
2 
+ 67J2 (=) (1 : sin?) , 
Pp 4 
R\2 
AQ = —31)p (=) COSL. (6.78) 


We will make use of the rate of pericenter advance when we turn to the orbit of Mercury 
in Chapter 7. We will employ the Lagrange planetary equations in Chapter 8 when we 
discuss orbital perturbations induced by the Nordtvedt effect and by preferred-frame and 
preferred-location effects, in Chapter 9 when we discuss orbital perturbations due to spin 
effects and in Chapters 10 and 12 when we discuss orbits involving compact bodies, such 
as binary pulsar systems and bodies orbiting supermassive black holes. 


6.5 Semiconservative Theories and N-body Lagrangians 


The overwhelming majority of modern alternative theories of gravity are based on an 
invariant action principle, and so they automatically generate a post-Newtonian limit that 
is semiconservative. It is therefore appropriate to re-express the post-Newtonian N-body 
equations of motion with the constraints a3 = ¢, = 0. Ina PPN coordinate frame at rest with 
respect to the universe rest frame, the resulting equations consist of the standard Newtonian 
acceleration (here we ignore the GWEP violating effects on the inertial, passive and active 
masses) plus the post-Newtonian N-body acceleration a Nboay, Eq. (6.46c) with w = 0 and 
with semiconservative PPN parameters, 
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Remarkably, these equations of motion can be derived from the Euler-Lagrange equations 
obtained by varying the trajectory x,(t), v,(¢) of the gth particle in the action 


INbody = [ue (6.80) 
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A Hamiltonian H can then be constructed from L in the usual way by defining pi = OL/ dv} 
and H = )° p,, - ¥a — L, with the result 
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where we have set w = 0, for simplicity. A Hamiltonian formulation of the equations of 
motion is useful for analyzing the dynamics of many-body systems. 


6.6 The Locally Measured Gravitational Constant 


Here, we derive an equation which is not really an equation of motion, but is nevertheless 
a fundamental result within the PPN formalism. In the previous section, we found 
that some metric theories of gravity could predict a violation of GWEP (Nordtvedt 
effect). Such effects represent violations of the Strong Equivalence Principle (SEP). As 
discussed in Section 3.3, the existence of preferred-frame and preferred-location effects in 
local gravitational experiments would also represent violations of SEP. One such local 
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gravitational experiment is a measurement of Newton’s constant G, usually called the 
Cavendish experiment, in honor of the experiments carried out by Henry Cavendish in the 
late 1700s. In an idealized version of this experiment one measures the relative acceleration 
of two bodies as a function of their masses and of the distance between them. Distances and 
times are measured by means of physical rods and atomic clocks at rest in the laboratory. 
The gravitational constant G is then identified as that number with dimensions cm>g~!s~? 
which appears in Newton’s law of gravitation for the two bodies. This quantity is called 
the locally measured gravitational constant GL. 

The analysis of this experiment proceeds as follows: a body of mass m, (source) falls 
freely through spacetime. A test body with negligible mass moves through spacetime, 
maintained at a constant proper distance r, from the source by a four-acceleration A 
provided by some nongravitational force. The line joining the pair of masses is nonrotating 
relative to asymptotically flat inertial space. An invariant “radial” unit four-vector E,, points 
from the test mass toward the source. Then according to Newton’s law of gravitation the 
radial component of the four-acceleration of the test mass is given by 


peeeG, |, (6.83) 
"p 


for rp small compared to the scale of inhomogeneities in the external gravitational fields. 
Since the quantity A -E, is invariant, we can calculate it in a suitably chosen PPN coordinate 
system, then use Eq. (6.83) to read off the locally measured G_. 

Before carrying out the computation, however, it is instructive to ask what might be 
expected for the form of A - E,. to post-Newtonian order. We imagine that the source and the 
test body are moving with respect to the universe with velocity w, and are in the presence 
of some external sources, idealized as point masses of mass m, at location x,. It is simplest 
to do the calculation in a PPN coordinate system in which the source is momentarily at 
rest. Then we would expect A - E, to contain post-Newtonian corrections to the equation 
A-E.= my /r, of the form 


m, Mm, m,m m,m My, 
A-E,: ; . ae (wi), (6.84) 
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where rj, = |x — Xq|. In obtaining these possible corrections, we have neglected the 


variation of the external gravitational potentials across the separation 7. Such a variation 
will produce the standard Newtonian tidal gravitational force, which is of the form 


(A-E,)tidal? = "Tp > (6.85) 
Ma 

and post-Newtonian corrections to this force. We will neglect such forces throughout. The 
first term in Eq. (6.84) represents post-Newtonian modifications in the two-body motion 
of the test body about the source; these can be understood and analyzed separately from 
a discussion of G,. The third term represents effects due to the gradients of the external 
fields; however, if we fit A-E,. to an i curve in order to determine G_,, these terms will have 
no effect (in most practical situations, they are smaller than the terms we are interested in 
by factors of r,,/r12). We will drop both types of terms throughout the analysis. Thus, we 


retain only terms of the form (mj /r>)(ma/ria) or (m/r>) wy. 
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The form of the PPN metric that we will use is given by the expression in Box 4.1, where 
now the velocity w is the source’s velocity relative to the mean rest frame of the universe, 
denoted w,. We label the test body by a = 0, the source by a = | and the remaining bodies 
by a = 2, 3,.... Initially, both the source and test body are at rest, that is, 


v(t = 0) = vo(t = 0) =0. (6.86) 


We separate the Newtonian gravitational potential U; due to the source from that due to 
the other bodies in the system: 


Ma 
U(x) = a 
(x) =Ui(n)+ >> a (6.87) 
aAl 
where r) = |x — x1|, a = |x — X,|, and Uj is assumed for simplicity to be spherically 


symmetric. 
The proper distance between the test body and the source is given by applying Eq. (3.48), 
with the result 


pein ‘ [1 + U(x) + O(e)] dix] 


=r {1+7>.™], (6.88) 


where, in line with our starting assumptions, we have neglected the variation of the external 
gravitational potential across the separation 79; and we have dropped the term arising from 
the integral of U; from xo to x; because it leads to effects only in the two-body motion 
of the test body about the source. The proper distance 7, is to be kept constant by the 
four-acceleration A, thus 


dry Pr, 
—_—=— =0. : 
dt dt Oe”) 
Recalling that v; = vp = 0 at ¢t = 0, it is simple to show that this implies that 
dvo dy, 
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where 19; = X01 /T01- 
We now assume that the source follows a geodesic of spacetime, but that the test body 
undergoes a four-acceleration A, so that 


y las = 
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with wi. ,, = 0. In PPN coordinates, with v; = vo = 0, this may expressed at t = 0 as 
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with A° = 0, where, for the test body, 


2 
(F) =1-2)°™10(2), (6.93) 
aAl 
where we again ignore spatial variations in external potentials and drop the contribution 
of the source potential. We use the PPN Christoffel symbol c from Box 6.1, and retain 
only the terms discussed earlier. Substituting Eq. (6.92) into Eq. (6.90), and including the 
Newtonian tidal force for illustration, we obtain 
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For a spherically symmetric body, the potential U; and superpotential X, are given by 
1/ 
UH=3,  X=mm+s—, (6.96) 
rol 3 ror 


where J; is the source’s spherical moment of inertia, given by 
= i praex. (6.97) 
1 


The radial unit four-vector E, is proportional to the coordinate unit vector, El = —anj,, 
with £° = 0. Imposing the normalization 8, rE; EY = 1, and including in the metric only 
external potentials, we obtain 


, 5 Ma 
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The final result for the invariant radial component of the four-acceleration A is 
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The first term in Eq. (6.99) is simply the Newtonian tidal acceleration; minimizing these 
accelerations is a significant challenge in real-life Cavendish experiments. From the second 
term, we can read off the locally measured gravitational constant, 


Gy, =1—-— 48 —y-3-G@-€(3+x)] Vex [a — a3 — ay (1— x)] wy 
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Here, we see a direct example of the possibility of violations of the Strong Equivalence 
Principle, via preferred-frame or preferred-location effects in local Cavendish experiments. 
The preferred-frame effects depend upon the velocity w, of the source relative to the 
universe rest frame, and are present unless the PPN preferred-frame parameters ay, a2, 
and a3 all vanish. The preferred-location effects depend upon the gravitational potentials 
Uext and ue of nearby bodies, and are present in general unless the PPN parameters satisfy 
46 —y-3-—G© =€ =0. We note that general relativity predicts that GL = 1. 

For an alternative derivation of G, that involves transforming the PPN metric to a locally 
comoving quasi-inertial frame of a chosen massive body and examining the effective 
“Newtonian” potential of that body in the presence of motion relative to the preferred 
frame and of external matter, see PW section 13.2.6. 


6.7 Equations of Motion for Spinning Bodies 
a ee ee 


In Section 6.3 we ignored the finite extent of each body, except insofar as it would permit 
internal kinetic, gravitational and other forms of energy. We ignored any finite-size effects 
that would lead to tidal interactions and their post-Newtonian corrections, for example. 
But this meant that we also ignored the effects of rotation or spin of the bodies. But 
rotation is everywhere, and it is important to incorporate it in a description of the motion 
of an N-body system. In gravitational physics, it is becoming increasingly clear that spin 
effects play a central role in such phenomena as binary black-hole inspirals, gravitational 
collapse, accretion onto compact objects, and the emission of gravitational radiation. In 
addition, several key experimental tests of general relativity have involved the effects of 
spin. By spin, we mean the macroscopic rotation of an extended body, and not the quantum- 
mechanical spin of an elementary particle. 

The motion of spinning bodies in curved spacetime has been a subject of considerable 
research for many years. This research has been aimed at discovering (i) how a body’s 
spin alters its trajectory (deviations from geodesic motion) and (ii) how a body’s motion 
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in curved spacetime alters its spin. No rigorous solution is available for these problems 
because of the difficulties in defining a center of mass of a spinning body in curved 
spacetime. The extensive literature on this subject includes work by Mathisson (1937), 
Papapetrou (1951), Corinaldesi and Papapetrou (1951), Barker and O’Connell (1974), and 
Dixon (1979). For a review of this history, see Havas (1989). 

However, in the post-Newtonian approximation, it is relatively straightforward to 
incorporate spin. In this section we sketch the method for obtaining both the equations 
of motion for the center-of-mass of each spinning body, and the evolution equations for the 
spin of each body. More details can be found in PW, chapters 9 and 13. 

The starting point is a simple and natural definition of the spin tensor (see Box 6.2), 


si = i; p* (x/vk — x9) Bx, (6.103) 


where X¥ = x — X,(9) and ¥ = v — v,,o) are the position and velocity of a fluid element 
relative to the body’s baryonic center of mass, defined by Eq. (6.35). We also introduce a 
vectorial version of the spin angular momentum, defined by 


s,= / p*x x Dax. (6.104) 


a 


It is easy to show that the tensor and vector are related by 


sig sen Set, Sik = eilogp (6.105) 


World lines for spinning bodies 


For spinning bodies, there is a basic ambiguity in the definition of the world linex, of the body. To understand 
this, we define a four-tensor spin quantity for body a by the equation 


ew) / pr (xl! — yx, (6.106) 
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a form modeled after Eq. (4.69), with the center of the coordinate system there being replaced with the 
variable x!’ which denotes the “representative world line” of the body. Here x” = x? = t,andv? = 1. Then 
it is straightforward to show that 


fv — —14 (xd — x/) ! 
Ji = SH + ama xe — vt, (6.107) 


To relate the baryonic center of mass x2 tothe world-line x/ we must constrain J? . One possibility is to require 
fi = 0,s0 that xj = x/, and the spin tensor J/*” is purely spatial in the PPN coordinate frame. But another 
possibility is to impose the “covariant” condition 


Jeg, =0, (6.108) 


where uf’ is the body’s four-velocity. This condition asserts that the spin tensor J/'” is purely spatial in the 
body's rest-frame. To lowest PN order, it can be expressed in the form 


=e (6.109) 
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One can interpolate between the two conditions using a parameter A, with fi = She leading to a 
relation between the center of mass and the world-line given to lowest PN order by 


xd = + me yk (6.110) 


Note that the difference between these world line definitions is of post-Newtonian order. The condition that 
fixes the representative world line of a spinning body is often called a “spin supplementary condition” (Barker 
and O'Connell, 1974). This is another example of the ambiguity inherent in defining the center of mass or 
the world line of an extended body. The important thing is to make a choice and to maintain that choice 
throughout all calculations. 


It is then relatively straightforward to return to Section 6.3 and redo the calculation of 
the equations of motion for a system of bodies, but now including specific finite-size terms 
in which the combination v/x* is integrated over a body, leading to a term proportional to 
sit. Only terms linear in the extent x* for any given body are kept, with the result that only 
terms linear in spins, or involving the product of spins of two different bodies are kept. It 
turns out that one of the virial relations of Eq. (6.43) is modified by a term involving spin: 
Ld ik — gy) _ 3K 4 5tp, — 2 4 326 es 7 (ne, - 5°). (6.111) 
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We also transform from the center of mass to the representative world line a ae 


Eq. (6.110) in both the acceleration daj/dt and the Newtonian term Yo, ma, [ss 
obtain the additional spin contributions to the equations of motion (dropping the oe 


bq = Aq[SO] + ag[ss] , (6.112) 


where SO and SS denote spin-orbit and spin-spin terms, given by 
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It is useful to note that, when A = 1, corresponding to the covariant spin condition, all 
reference to the velocity of the PPN coordinate system disappears. All terms depend on 
relative positions x,,, and on either relative velocities v, — v, or the velocity of a given 
body relative to the universe rest frame, w+ v, or w+ Vp. 

To obtain the equations of motion for the spin vector, we simply insert the PPN equations 
of hydrodynamics into the expression 


de = cm f pv Debs. (6.115) 
: dt 
We again expand the potentials keeping only terms linear in the variable x’. We absorb back 
into the definition of S/ a set of PN terms that involve only the internal variables of body a 
and that can be expressed as a total time derivative. Finally, we define a PN-corrected spin 
called the “proper” spin, S, according to 


1 
“Js, 5 (Ya Sa)¥a (6.116) 


This turns out to be the spin as measured in a quasi-Lorentz frame momentarily comoving 
with body a (see PW, section 9.5.7). The result is a spin evolution equation given by 
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where the first two terms depend on the internal structure of the body: 
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where the barred potentials are generated only by internal variables of the body. The 
“self” terms are present only in non-conservative theories. For a body that is spherically 
symmetric to a good approximation, the integrals will all be proportional to 64, which is 
killed by the contraction with «/”?. For a stationary axisymmetric body, the only quantities 
available to construct a two-index tensor are 6?4 and ee’, where e is a unit vector in the 
direction of the symmetry axis, and these also are killed by a contraction with €/?4. The 
strange nonconservative precessions are therefore relevant only for rather oddly shaped 
bodies. The 2, term depends on the velocity of body a relative to the preferred frame, 
and on the distribution of distant matter via the Whitehead potential. For a spherically 
symmetric body, both terms are killed by the contraction with «/?4. In addition, for 
laboratory-sized spinning bodies, such as the gyroscopes of Gravity Probe B, these terms 
are utterly negligible. 
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The final term in Eq. (6.117) is the PPN equation of spin precession, 


dS, - 
( a ) =Oq x Sa, (6.119) 
dt prec 
where (2, contains spin-orbit, spin-spin and preferred-frame contributions, given by 
OQq = Nq[SO] + Na[Ss] + Qa[PF] , (6.120) 
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The use of the proper spin ensures that this is a pure precession, in other words, the 
magnitude of the spin does not change as a result of its interaction with the other bodies in 
the system. The equations for spin evolution are independent of the parameter 4. 

It is useful to note that Q, can be expressed in the alternative form 


1 1 
4 = 527+ 1a x VU-5V x8, (6.122) 


where U = )°,(mp/ra») and g/ = go;, where go; may be read off from the PPN metric in 
Box 4.1, noting that, for a system of spinning bodies, the vector potential V; has the form 


MpVp 1 Nab X Sp 
V= pI =e + Owns) ; (6.123) 
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Equation (6.119) gives the precession of the proper spin relative to the PPN coordinate 
frame, whose axes are non-rotating relative to distant matter. We will discuss the 
observable consequences of this precession in Chapter 9. 
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With the PPN formalism and its associated equations of motion in hand, we are now ready 
to confront the gravitation theories discussed in Chapter 5 with the results of experiments. 
In this chapter, we focus on the three “classical” tests of relativistic gravity, consisting of 
(1) the deflection of light, (ii) the time delay of light, and (iii) the perihelion advance of 
Mercury. 

This usage of the term “classical” is a break with tradition. Traditionally, the term 
“classical tests” referred to the gravitational redshift experiment, the deflection of light, 
and the perihelion advance of Mercury. The reason is largely historical. These were the first 
observable effects of general relativity to be computed by Einstein, and he regarded these 
three as crucial tests of his theory. However, in Chapter 2 we saw that the gravitational 
redshift experiment is really not a test of general relativity, rather it is a test of the Einstein 
Equivalence Principle, upon which general relativity and every other metric theory of 
gravity are founded. Put differently, every metric theory of gravity automatically predicts 
the same redshift. For this reason, the the redshift effect has been dropped as a ”classical” 
test (that is not to deny its importance, of course, as our discussion in Chapter 2 points out). 

We can immediately replace it with an experiment that is as important as the other two, 
the Shapiro time delay of light. This effect is closely related to the deflection of light, 
as one might expect, since any physical mechanism in Maxwell’s equations (refraction, 
dispersion, gravity) that bends light can also be expected to delay it. In fact, it is a mystery 
why Einstein did not discover this effect. It was discovered as a prediction of Einstein’s 
theory in 1964, by radio astronomer Irwin Shapiro. The simplest explanation seems to be 
that Shapiro had the benefit of knowing that the space technology of the 1960s and 1970s 
would make feasible a measurement of a delay of the expected size (200 ys for a round 
trip signal to Mars). No such technology was known to Einstein. He was aware only of 
the known problem of Mercury’s excess perihelion advance of 43 arcseconds per century, 
and of the potential ability to measure the deflection of starlight. But the lack of available 
technology may not be the whole story. After all, Einstein derived the gravitational redshift 
at a time when the hopes of measuring it were marginal at best (a reliable measurement 
was not performed until 1960), and other workers such as Lense and Thirring and de Sitter 
derived effects of general relativity, with little or no hope of seeing them measured using 
the technology of the day. Why then, did no one at the time take the step from deflection 
to time delay, if only as a matter of principle? 

Nevertheless, despite its late arrival, the time delay deserves a place in the triumvirate of 
“classical” tests, not the least because it has given one of the most precise tests of general 
relativity to date! 
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We begin this chapter with measurements of the deflection of light, Section 7.1, then 
turn to the Shapiro time delay, Section 7.2, and in Section 7.3, we discuss the perihelion 
advance of Mercury. 


7.1 Deflection of Light 


Eq. (6.21) indicates that the path of a light ray is deflected by a distribution of mass, but it 
does not tells us how to measure the effect. We can measure the direction of an incoming 
photon from some source, but we have no information about the initial direction. In order 
to make this a testable effect we must use only quantities that can be observed directly. The 
key quantity is the angle between two sources. Then, if we can measure that angle at a time 
when the mass distribution is sufficiently far from the sources as seen on the sky that we 
can ignore any deflection, and then measure the angle when the mass is close by, we can 
measure the differential deflection. 

We can give this angle a coordinate-invariant expression. Consider an observer at rest in 
the PPN coordinate system, with four velocity uw“, who receives a signal from a target source 
and a reference source, whose trajectories are x(t) and x;'(t), respectively. The spatial 
directions of the incoming signals are given by projecting the tangent four-vectors v4 = 
dx" /dt and vi’ = dxt' /dt of the rays onto the hypersurface orthogonal to the observer’s 
four-velocity, using the projection operator P/, = 07, +u"u,,. The inner product between the 
resulting vectors is related to the cosine of the angle @ between the incoming directions by 


UP EP gs _ V-V, 
PXPEDEPy] | a) Ore) | 
If we ignore the velocity of the observer, which only produces effects such as aberration, 
which can be readily accounted for if necessary, then Eq. (7.1) simplifies to 


cos 6 = 1 — (g90) | gyuvve ve . (7.2) 


cos = (7.1) 


By substituting Eq. (6.17) into Eq. (7.2), we obtain, to post-Newtonian accuracy, 
cosO=k-k,+k-a,+hkh,-a. (7.3) 


We now consider the simple case where the source is a spherical body at the origin 
of the PPN coordinate system. Assuming that the ray is always outside the body, then in 
Eq. (6.13), Ux = —mx*(t)/r(t)?, and Eq. (6.15) becomes da/dt = —(1 + y)mb/r(t)>, 
where now b = x, — (x.-k)k. Employing the identity (6.18b) with s = x(t) and integrating 
with respect to ¢, we obtain 

a(t) = (1 + mee (= =). (7.4) 
b?\ +r Te 


Substituting Eq. (7.4) into Eq. (7.3) we obtain, to post-Newtonian order, 


b-k, -k ek 
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where xg is the location of the observer, and the subscript r denotes the reference source. 
It is useful to note, to the post-Newtonian accuracy needed in Eq. (7.5), that 


b=xo—(xo-k)k, 6b, =xo0— (xo-k,)k-. (7.6) 


We now define the angle 4 to be the angle between the unperturbed paths of the photons 
from the source and from the reference source, i.e., 


cos 6) = k-k,., (7.7) 
and we define the “deflection” of the measured angle from the unperturbed angle to be 
00=0-%. (7.8) 


There are two interesting cases to consider. This first is an idealized situation that leads 
to a simple formula. We suppose that the Sun itself is both the massive source and the 
reference source. Then the second term inside the braces never appears (since b = 0a 
priori), b- k,/b = sin @, and 


50 = l+y7)\2m (xq-k xe-k (7.9) 
2 b ro Le 


For a photon emitted from a distant star or galaxy, re > re, xXek/re ~ —l,andxg-k/re ~ 
cos 69, and we obtain (Shapiro 1967; Ward 1970), 


_ {l+y\4m 1+ cos 
v0 = ( 5 )*( 5 ie (7.10) 


The deflection is a maximum for a ray which just grazes the Sun. For #9) ~ 0,5 ~ Ro & 
6.96 x 10° km, m = me = 1.476 km, we obtain 


1 
5Omax = 1.7504 (=) arcsec . (7.11) 


For rays coming from arbitrary directions, we use the fact that b ~ rg sin 4) to express the 
deflection in the form 
50 = 1+~7)\ 2m (1+ cos 6 
— 2 re sin 09 


1 0 
= 4.072 (=) cot e mas. (7.12) 


At 90° from the direction of the Sun, the deflection is about 4 mas. 

The second case to consider is more closely related to the actual method of measuring 
the deflection of light using the techniques of radio or optical interferometry. In these 
techniques, one chooses a reference source near the observed source and monitors changes 
60 in their angular separation. If we define ® and ®,. to be the angular separation between 
the Earth-Sun direction and the unperturbed direction of photons from the two sources, as 
in Figure 7.1, then 


Xo -k Xo -k 
e cos 6, = —8@ "| 


ae) ae) 


cos ® = (7.13) 
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reference 


Geometry of light deflection measurements. 


Assuming again that the two sources are very distant and using the fact that b = rg sin® 
and b, = rg sin ®,, we obtain an expression purely in terms of measured angles on the sky: 


50 = 1+7)\ 2m [| cos ®, — cos ® cos 69 cos ® — cos ®,.cos 
7 2 (1 — cos ®) sin % (1 — cos ®,.) sin Ao 


(7.14) 


Laas) 
Note that the angle 9p is related to the angle . between the Sun-source and Sun-reference 
directions projected on the plane of the sky (Figure 7.1) by the formula from spherical 
geometry 


cos J) = cos ®,cos ® + sin ®,sin ®cos x. (7.15) 


If the observed source direction passes very near the Sun, while the reference source 
remains a decent angular distance away, we can approximate ® < ®,, and thus 


a) 
cos ®,.sin X 52 


A) ~ 6, —©® + O(®?). 7.16 
F cos y + eo (®°) (7.16) 
The resulting deflection is 
1 4 1 1+2sin* 0 
59 = (247) PT cog yp LEU EPS Osho gs o(a%)]. 7 
2 b 2 sin A 


This result shows quite clearly how the relative angular separation between two distant 
sources may vary as the lines of sight of one of them passes near the Sun (6 ~ Ro, x 
varying). 


7.1.1 Tests of the deflection of light 


The first successful measurement of the bending of light by the Sun was carried out by 
Eddington and his colleagues during the total solar eclipse of May 29, 1919. The principle 
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Why 5(1 +)? 


The deflection of light can be viewed as arising from two effects. The first, corresponding to the “1/2” part of 
the coefficient, is commonly called the “Newtonian” deflection. One way to derive this is to assume that light 
behaves as a particle, to recall that the trajectory of a particle is independent of its mass (Weak Equivalence 
Principle), and to calculate the deflection of its trajectory in the limit in which the particle's speed approaches 
the speed of light. Such an approach would have made sense in Newton's day, when light was really viewed 
asa “corpuscle,’ and, indeed, Newton himself speculated on the possible effect of gravity on light. The English 
physicist Henry Cavendish may have been the first person to calculate the bending explicitly, possibly as early 
as 1784, although evidence for this was not discovered until around 1914, during an effort to compile and 
publish his entire body of work — publication having never been high on Cavendish’ list of priorities. In fact, 
all that was found was a scrap of paper in Cavendish’s handwriting stating that he had done the calculation, 
and giving the answer. Independently of Cavendish, the Bavarian astronomer Johann von Soldner did publish 
in 1803 a detailed calculation of the Newtonian bending in a German astronomical journal. Strangely, von 
Soldner’s calculation was largely forgotten until it was resurrected in 1921 by Phillip Lenard as part of a 
campaign to discredit the “Jewish” relativity of Einstein by publicizing the earlier work of the “Aryan” von 
Soldner. Apparently, Lenard was not deterred by the fact that the 1919 measurements by Eddington actually 
favored general relativity over the Newtonian deflection. 

Unaware of the earlier work, Einstein himself derived the “Newtonian” deflection in 1911. He argued that 
the principle of equivalence requires the replacement of the Minkowski metric of flat spacetime by the metric 


is — eau ai de aigye se 


Geodesic motion for a test particle in this spacetime reproduces Newtonian gravity, and geodesic motion for 
a photon gives the Newtonian deflection. Another derivation using only the equivalence principle imagines 
a sequence of freely falling frames through which a light ray passes as it travels near a gravitating body. Each 
frame is momentarily at rest at the moment the light ray enters it. Although the path of the ray is a straight line 
within each frame, the frame picks up a downward velocity during the ray’s traversal, because of the body’s 
gravitational attraction. When the adjacent, momentarily stationary frame receives the light ray, the ray is 
deflected toward the body because the downward motion of the previous frame induces aberration on the 
received ray. By adding up all the tiny aberrations over a sequence of frames, one arrives at the Newtonian 
deflection. 

But a relativistic metric theory of gravity introduces an additional effect, because the spatial part of the 
metric now comes with the multiplying factor (1 + 2-yU). This represents spatial curvature, which could not 
be taken into account either by Newtonian gravity or by the principle of equivalence. So the total deflection 
can be viewed as a sum of a Newtonian deflection relative to locally straight lines, plus the bending of locally 
straight lines relative to straight lines at infinity. Thus in the coefficient (1 + y) /2, the “1/2” holds in any 
metric theory, while the “~y /2” varies from theory to theory. 


of the experiment was to make differential measurements. Photographs of the stars near 
the Sun taken during the eclipse were to be compared with photographs of the same stars 
taken at night some months earlier or later, and the displacement of each star compared 
to the reference photograph was to be carefully measured. But on the day of the eclipse 
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at Eddington’s site, a rainstorm started, and as the morning wore on, he began to lose all 
hope. But at the last moment, the weather began to change for the better, and when the 
partial eclipse was well advanced, the astronomers began to get a glimpse of the Sun. Of 
the sixteen photographs taken through the remaining cloud cover, only two had reliable 
images, totaling only about five stars. Nevertheless, comparison of the two eclipse plates 
with a comparison plate taken at the Oxford University telescope before the expedition 
yielded results in agreement with general relativity, corresponding to a deflection at the 
limb of the Sun (grazing ray) of 1.60 + 0.31 arcseconds, corresponding to 0.91 + 0.18 for 
(1 + y)/2. The Sobral expedition, blessed with better weather, managed to obtain eight 
usable plates showing at least seven stars each. The nineteen plates taken on a second 
telescope were deemed worthless because the telescope apparently changed its focal length 
just before totality of the eclipse, possibly as a result of heating by the Sun. Analysis of 
the good plates yielded a grazing deflection of 1.98 + 0.12 arcseconds, or 1.13 + 0.07 for 
(1 + y)/2 (Dyson et al., 1920). These results are indicated by the point and the arrow (off 
the chart) to the far left of Figure 7.2. Eddington’s announcement in November 1919 that 
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Measurements of the coefficient (1 + y) /2 from light deflection and time delay measurements. Its GR value is 
unity. The arrows at the top denote anomalously large values from early eclipse expeditions. The Shapiro 
time-delay measurements using the Cassini spacecraft yielded an agreement with GR to 10~? percent, and VLBI 
light deflection measurements have reached 0.01 percent. Hipparcos denotes the optical astrometry satellite, 
which reached 0.2 percent. 
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the bending measurements were in agreement with general relativity helped make Einstein 
an international celebrity. 

Questions were raised, however, about the results of Eddington’s measurements. Given 
the poor quality of the data, did they really support Einstein or not? Was it proper for 
Eddington to discard the data from the second telescope at the Sobral site? More recently, 
some (Earman and Glymour, 1980) have wondered whether Eddington’s enthusiasm for the 
theory of general relativity caused him to select or massage the data to get the desired result. 
Numerous reanalyses between 1923 and 1956 of the plates used by Eddington yielded the 
same results within 10 percent. In 1979, on the occasion of the centenary of Einstein’s 
birth, astronomers at the Royal Greenwich Observatory reanalysed both sets of Sobral 
plates using a modern tool called the Zeiss Ascorecord and its data reduction software 
(Harvey, 1979). The plates from the first telescope yielded virtually the same deflection 
as that obtained by Eddington’s team with the errors reduced by 40 percent. Despite the 
scale changes in the second set of Sobral plates, the analysis still gave a result 1.55 + 0.34 
arcseconds at the limb, consistent with general relativity, albeit with much larger errors. 
Looking back on Eddington’s treatment of the data, Kennefick (2009) has argued that there 
is no credible evidence of bias on his part. 

The publicity surrounding Eddington’s famous announcement has left the impression 
that his was the only test of the deflection using eclipse measurements, successful or 
otherwise. But the history is much richer (Crelinsten, 2006). William Campbell and 
Heber Curtis of the Lick Observatory analyzed plates from a 1900 eclipse in Georgia 
and a 1918 eclipse in Washington state in the United States and found no deflection; 
ironically they reported this negative result at the Royal Society of London meeting in 
July 1919 in the midst of Eddington’s data analysis (at the meeting, rumors were already 
going around that Eddington would report a positive result). Following up on Eddington’s 
success, seven teams tried the measurement during a 1922 eclipse in Australia, although 
only three succeeded. Campbell and Robert Trumpler of the Lick team reported a result 
for the deflection at the limb of 1.72 + 0.11 arcseconds, while a Canadian team and an 
England/Australian team reported values between 1.2 and 2.3 arcseconds. Later eclipse 
measurements continued to give mild support to general relativity: one in 1929, two in 
1936, one each in 1947 and 1952, and one in 1973 (light grey points and arrows in 
Figure 7.2). Surprisingly, there was very little improvement in accuracy, with different 
measurements giving values anywhere between three-quarters and one and one-third times 
the general relativistic prediction. On the other hand, there was little doubt that the Newto- 
nian deflection was wrong [for reviews, see von Kliiber (1960) and Bertotti et al. (1962)]. 

The 1973 expedition is a case in point. Organized by the University of Texas and 
Princeton University, the observation took place in June at Chinguetti Oasis in Mauritania.! 
The observers had the benefit of 1970s technology: modern photographic emulsions, a 
temperature-controlled telescope shed, sophisticated motor drives to control the direction 
of the telescope accurately, and computerized analysis of the photographs. Unfortunately, 
they couldn’t control the weather any better than Eddington. Eclipse morning brought 


' The author vividly remembers Bryce DeWitt’s slide show on scouting a site for the observations presented 
at the Les Houches Summer School on black holes in August 1972, including photos of DeWitt and Richard 
Matzner, precariously perched atop camels. 
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high winds, drifting sand, and dust too thick to see the Sun. But as totality of the 
eclipse approached, the winds died down, the dust began to settle, and the astronomers 
took a sequence of photographs during what they have described as the shortest six 
minutes of their lives. They had hoped to gather over 1000 star images, but the dust 
cut the visibility to less than 20 percent and only a disappointing 150 were obtained. 
After a follow-up expedition to the site in November to take comparison plates, the 
photographs were analyzed using the GALAXY Measuring Engine at the Greenwich 
Observatory, with a result 0.95 + 0.11 times the general relativity prediction, essentially no 
improvement in accuracy over previous eclipse measurements (Brune et al., 1976; Jones, 
1976). 

During the 2017 solar eclipse that passed over the United States, a number of amateur 
and professional astronomers endeavored to improve upon these results by exploiting new 
technologies such as CCD cameras, together with precise reference locations of stars 
provided by space astrometry telescopes, such as GAIA. Bruns (2018) achieved a three 
percent result in agreement with general relativity. 

The development of radio interferometry, and later of very-long-baseline radio interfer- 
ometry (VLBI), produced greatly improved determinations of the deflection of light. These 
techniques now have the capability of measuring angular separations and changes in angles 
to accuracies better than 100 microarcseconds. Early measurements took advantage of 
the fact that groups of strong quasars annually pass very close to the Sun (as seen from 
the Earth), such as the group 3C273, 3C279 and 3C48. As the Earth moves in its orbit, 
changing the lines of sight of the quasars relative to the Sun, the angular separation between 
pairs of quasars varies according to Eq. (7.17). A number of measurements of this kind over 
the period 1969-1975 yielded determinations of the coefficient (1 + 7), or equivalently 
of y — 1, reaching levels ofa percent. A 1995 VLBI measurement using 3C273 and 3C279 
yielded y — 1 = (—8 +34) x 1074 (Lebach et al., 1995), while a 2009 measurement using 
the VLBA targeting the same two quasars plus two other nearby radio sources yielded 
y —1=(-2 +43) x 1074 (Fomalont et al., 2009). 

In recent years, transcontinental and intercontinental VLBI observations of quasars and 
radio galaxies have been made partly to monitor the Earth’s rotation and partly to establish 
a “reference frame” for precision astrometry (“VLBI” in Figure 7.2). These measurements 
are sensitive to the milliarcsecond level deflections of light over almost the entire celestial 
sphere. A 2004 analysis of almost 2 million VLBI observations of 541 radio sources, made 
by 87 VLBI sites yielded y — 1 = (—1.7 + 4.5) x 1074 (Shapiro et al., 2004). Analyses 
that incorporated data through 2010 yielded y — 1 = (—0.8 + 1.2) x 10-4 (Lambert and 
Le Poncin-Lafitte, 2009, 2011). 

To reach high precision at optical wavelengths requires observations from space. The 
Hipparcos optical astrometry satellite making global measurements of the deflection 
yielded a result for (1 + y)/2 at the level of 0.2 percent (Froeschlé et al., 1997). 
GAIA, a high-precision astrometric orbiting telescope launched by ESA in 2013 (Gaia 
Collaboration et al., 2016) possesses astrometric capability ranging from 10 to a few 
hundred microarcseconds, plus the ability to measure the locations of a billion stars down 
to the 20th magnitude; it could eventually measure the light-deflection and y to the 10~° 
level (Mignard and Klioner, 2010). 
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71.2 Gravitational lenses 


In 1979, astronomers Dennis Walsh, Robert Carswell, and Ray Weymann discovered the 
“double quasar” Q0957+561, which consisted of two quasar images about 6 arcseconds 
apart, with almost the same redshift (¢ = 1.41) and very similar spectra (Walsh et al., 
1979). It was immediately realized that there was just one quasar, but that intervening 
matter in the form of a galaxy or a cluster of galaxies was bending the light from the 
quasar and producing two separate images. 

Since then, the “gravitational lens” has become an important tool for astronomers. 
The phenomenon has been exploited to map the distribution of mass around galaxies 
and clusters, and to search for dark matter, dark energy, compact objects, and extrasolar 
planets. Many subtopics of gravitational lensing have been developed to cover different 
astronomical realms: microlensing for the search for dim compact objects and extrasolar 
planets, the use of luminous arcs to map the distribution of mass and dark matter, and weak 
lensing to measure the properties of dark energy. Lensing has to be taken into account in 
interpreting certain aspects of the cosmic microwave background radiation, and in extract- 
ing information from gravitational waves emitted by sources at cosmological distances. 

Gravitational lensing has also yielded a remarkable measurement of (1+) /2 on galactic 
scales (Bolton et al., 2006). It used data on gravitational lensing by 15 elliptical galaxies, 
collected by the Sloan Digital Sky Survey. The mass distribution of each lensing galaxy 
(including the contribution from dark matter) was derived from the observed velocity 
dispersion of stars within the galaxy. Comparing the observed lensing with the lensing 
predicted by the deflection formula (6.20) provided a five percent bound on (1 + y)/2, in 
agreement with general relativity. Although the accuracy was only comparable to that of 
Eddington’s 1919 measurements, this test of Einstein’s light deflection was obtained on a 
galactic, rather than a solar-system scale. 


7.2 The Shapiro Time Delay 


Egs. (6.17) or (6.21) reveal that, in addition to being deflected in the gravitational field of a 
massive body, a photon is also retarded in its motion. Its coordinate speed is less than unity 
(nevertheless, the speed measured by a local freely falling observer as the photon passes 
her is always unity). We consider the simple case of a light ray passing by a spherical body 
of mass m. Solving Eq. (6.21) for t — t., and recalling that k - dx, = 0, we obtain 


(r(t) + x(t) -k)(re — Xe - 4] 


(7.18) 


t-te = () 26 + (1 49) in| p 


where b = x. — (x. - k)k. For a signal emitted from the Earth, received at a planet or 
spacecraft at x, and returned to Earth, the roundtrip travel time Af is given by 


(Te +X@ +k) (tp — Xp =] 
b : 


At = 2|xg — xp| + 2(1 + a)mn (7.19) 
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where k is the direction of the photon on its return flight. Here we have ignored the motion 
of the Earth during the round trip of the signal. To be completely correct, the round trip 
travel time should be expressed in terms of the proper time elapsed during the round trip, as 
measured by an atomic clock on Earth; but this introduces no new effects so we will not do 
so here (in actual measurements of the delay, these effects are routinely taken into account). 
The additional delay 6¢ produced by the second term in Eq. (7.19) is a maximum when the 
planet is on the far side of the Sun from the Earth (superior conjunction), that is, when 


Xg:k~re, Xp k~—ry, b~Ro, (7.20) 


then 


4 
5t=2(1 + y)m in( a) 


1 b\* 
- (=) [2851s 19.7 us | (ze) «|| , (7.21) 


where Re is the radius of the Sun, and a = 1.496 x 108 km is the astronomical unit. 

Soon after radio astronomer Irwin Shapiro discovered this effect as a theoretical 
prediction of general relativity, a program of measurements began, using radar ranging 
to targets passing through superior conjunction. The first of these was carried out by 
Shapiro’s team (Shapiro et al., 1968, 1971) using radar signals bouncing off the surface 
of Mercury and Venus. Since one does not have access to a “Newtonian” signal against 
which to compare the round trip travel time of the observed signal, it is again necessary 
to do a differential measurement of the variations in round trip travel times as the target 
passes through superior conjunction, and to look for the characteristic logarithmic behavior 
of the delay. To achieve this accurately however, one must take into account the variations 
in |xg — x,| due to the orbital motion of the target relative to the Earth. This is done 
by using radar-ranging (and possibly other) data on the target taken when it is far from 
superior conjunction (i.e., when the time-delay term is negligible) to determine an accurate 
orbit for the target, using the orbit to predict the PPN coordinate trajectory x, near superior 
conjunction, then combining that information with the trajectory of the Earth xg to 
determine all the relevant quantities in Eq. (7.19). The resulting predicted round trip travel 
times in terms of the unknown coefficient 5(1 + 7) are then fit to the measured travel 
times using the method of least squares, and an estimate obtained s(1 +). This is an 
oversimplification, of course. 

The targets employed included planets, such as Mercury or Venus, used as passive 
reflectors of the radar signals, and artificial satellites, such as Mariners 6 and 7, Voyager 2, 
the Viking Mars landers and orbiters, and the Cassini spacecraft to Saturn, used as active 
retransmitters of the radar signals. 

The results for the coefficient $(1 + y) of all radar time-delay measurements performed 
to date are shown in the bottom half of Figure 7.2. The 1976 Viking experiment resulted in 
a 0.1 percent measurement (Reasenberg et al., 1979). 

A significant improvement was reported in 2003 from Doppler tracking of the Cassini 
spacecraft while it was on its way to Saturn (Bertotti et al., 2003), with a result y — 1 = 
(2.1 + 2.3) x 10~°. This was made possible by the ability to do Doppler measurements 
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using both X-band (7175 MHz) and Ka-band (34316 MHz) radar, thereby significantly 
reducing the dispersive effects of the solar corona. Note that with Doppler measurements, 
one is essentially measuring the time derivative of the Shapiro delay. In addition, the 
2002 superior conjunction of Cassini was particularly favorable: with the spacecraft at 
8.43 astronomical units from the Sun, the distance of closest approach of the radar signals 
to the Sun was only 1.6 Ro. For scalar-tensor theories, the Cassini bound places a limit 
wo > 40,000 or ay < 3.5 x 1073. For the case of massive scalar fields, Alsing et al. 
(2012) displayed the bounds as a function of the mass. Improved measurements of the 
Shapiro delay, down to the level of parts per million, may be possible using data from the 
cruise phase of the joint European-Japanese BepiColombo project to place two orbiters 
around Mercury (Benkhoff et al., 2010), scheduled for launch in late 2018 (Imperi and 
Tess, 2017). 

The Shapiro delay has been measured and plays an important role in analyzing binary 
pulsar data, most notably from the “double pulsar” JO737-3039 and even, recently, from 
the original Hulse-Taylor binary pulsar B1913+16 (see Chapter 12). In combination with 
the geometric delay caused by light deflection, the Shapiro delay also plays a role in efforts 
to measure the Hubble constant using time delays between images in gravitational lenses 
(see Meylan et al. (2006) for recent reviews). 


7.3 The Perihelion Advance of Mercury 
SCC 


The explanation of the anomalous perihelion shift of Mercury’s orbit was an early triumph 
of GR. Since Le Verrier’s 1859 announcement of an unexplained advance in the perihelion 
of Mercury after the perturbing effects of the other planets had been accounted for, no 
credible explanation had been found, despite numerous attempts. The modern value for 
this discrepancy is about 43 arcseconds per century (see Table 7.1). General relativity 


Table 7.1 Planetary contributions to Mercury's 


perihelion advance (in arcseconds per century). 


Planet Advance 
Venus 277.8 
Earth 90.0 
Mars DES 
Jupiter 153.6 
Saturn a) 
Total S32 
Observed 574.1 


Discrepancy 42.9 
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accounted for the anomalous shift in a natural way without disturbing the agreement with 
other planetary observations. 

In Chapter 6, Eq. (6.78) we obtained the secular changes in the orbit elements of a 
two-body system in the PPN framework, including the effects of a Newtonian quadrupole 
moment for one of the bodies (but ignoring preferred-frame or preferred location effects 
for the moment). Since the orbit of Mercury is inclined by only 7° relative to the ecliptic 
plane of the solar system, and since changes in its perihelion are measured relative 
to the fixed reference system of the ecliptic plane, the relevant observable quantity is 
Aw = Aw + costAQ. From Eq. (6.78), with  ~ 0, we obtain 


1 
3 


Kes 67m 
Dp 


where now J and R are the quadrupole moment and radius of the Sun, and m = me + mys. 


1 30 (RY? 
(2+ 27— 8) + En (21 — a2 + a3 +20)| Pole (=) , 722) 


We have ignored a small additional quadrupole contribution caused by the fact that the 
Sun’s rotation axis is inclined relative to the ecliptic by 6 ~ 7°; this adds a correction to 
the quadrupole term that is too small to be measurable at present. 

The first term in Eq. (7.22) is the “classical” perihelion shift, which depends upon the 
PPN parameters y and (. The second term depends upon the dimensionless reduced mass 
of the two bodies; it is zero in any fully conservative theory of gravity (aj = a2 = a3 = 
C2 = 0); it is also negligible for Mercury, since 7 ~ mes /Me ~ 2x 1077. We will drop this 
term henceforth. The third term depends upon the solar quadrupole moment J2. For a Sun 
that rotates uniformly with its observed surface angular velocity, so that the quadrupole 
moment is produced by centrifugal flattening, one may estimate J) ~ 107’. This actually 
agrees reasonably well with values inferred from rotating solar models that are in accord 
with observations of the normal modes of solar oscillations (helioseismology); the latest 
inversions of helioseismology data give Jy = (2.2 + 0.1) x 1077 (Mecheri et al., 2004; 
Antia et al., 2008); for a review of measurements of the solar quadrupole moment, see 
(Rozelot and Damiani, 2011). Substituting standard orbital elements and physical constants 
for Mercury and the Sun we obtain the rate of perihelion shift da /dt, in seconds of arc per 
century, 


dw 1 —4 Jy 
— = 42.980 E (2+2y— 8B) +65 x 10 =) (7.23) 


(see Nobili and Will (1986) for a discussion of how the precise numerical value of the GR 
prediction is obtained). 

In the current method of testing gravitational theories using the dynamics of Mercury, 
the actual value of the predicted advance is somewhat irrelevant. Instead, the equations of 
motion used in all ephemeris computer codes to predict the orbits of Mercury and of all the 
planets are the PPpN N-body equations (6.46c). Those equations contain all the effects needed 
to determine the total advance of the perihelion of Mercury, including both the effects of 
the other planets and post-Newtonian effects. The modeling of the orbits depends on a host 
of parameters. These include masses and orbit elements at a chosen epoch of all the planets, 
of the Moon and many of the other moons of the planets, and of many asteroids. The effects 
of solar wind and radiation pressure on spacecraft must be modeled. Parameters associated 
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The solar oblateness controversy 


During the 1960s, Robert Dicke and Mark Goldenberg attempted to determine J, by measuring the Sun’s visual 
shape. Because the surface of the Sun is an equipotential, its shape is affected by J) in a way that can be 
directly related to the deformation of the external gravitational field. The shape was measured by inserting 
a circular, opaque disk in front of a telescopic image of the Sun, leaving only a thin visible ring at the edge 
of the Sun, and measuring the difference in brightness of the visible ring between the equator and pole of 
the Sun. If the Sun were oblate, the ring at the equator should extend further beyond the occulting disk, and 
should therefore be brighter. But many factors had to be corrected for, including the effects of atmospheric 
distortion on the observed shape of the Sun, and the effects of possible temperature differences between the 
polar and equatorial regions of the Sun, which would lead to brightness differences not associated with the 
shape. Dicke and Goldenberg claimed to have measured aJ, of the order of 2.5 x 10~°, over 100 times larger 
than the currently accepted value. Dicke postulated that such a large oblateness would occur if the core of 
the Sun were rotating much faster than its outer layers, thereby generating more centrifugal flattening than 
would be expected on the basis of the observed surface rotation alone (see Dicke and Goldenberg (1974) for a 
review). 

A value of J; this large would mean that solar oblateness contributes as much as 4 arcseconds per century 
to Mercury's perihelion advance, which would destroy the agreement of the measured advance with the 
prediction of general relativity. But it would have supported the Brans-Dicke theory of gravity if wap were 
chosen to be around 5, a value favored by Dicke for other reasons. Later observations of the visible shape of the 
Sun by Henry Hill and others, along with observations to try to better understand the temperature differences, 
did not fully resolve this controversy. 

The resolution came with the advance of helioseismology. This was the discovery that the Sun vibrates 
in a superposition of thousands of normal modes with an array of frequencies, as could be observed by 
measuring the frequency spectrum of Doppler-shifted solar spectral lines. The specific pattern of frequencies 
depends on the Sun's angular-velocity profile. Through a systematic program of ground-based and space- 
based observations of the Sun, it became possible to determine the Sun's rotational profile over much 
of its interior. The conclusion was that the core does not rotate much faster than the surface, and solar 
models consistent with this information produced the currently accepted value of J) = 2.2 x 10~’; this 
is approximately what one would infer from a Sun that rotates uniformly at its observed surface rate. 


with the effects of the interplanetary plasma and the Earth’s atmosphere on the propagation 
of the tracking signals, and with variations in locations of radar tracking stations caused 
by Earth rotation changes and tidal effects are included. Finally, the PPN parameters are 
included. The relevant data include centuries of optical observations of planetary motions 
on the sky, radar measurements of planets and spacecraft, and laser ranging to the Moon. 
Predicted orbits are compared with the data, and the parameters are adjusted to obtain 
a best fit in the least-squares sense. The results include an estimate for the PPN parameter 
combination (2 +27 — 3)/3, along with an associated error. Some data sets are particularly 
sensitive to the Shapiro time delay, and these provide accurate estimates of (1 + y)/2. 
Lunar laser ranging data are sensitive to the Nordtvedt effect (to be discussed in Chapter 8), 
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and improve the parameters of the Earth’s orbit about the Earth-Moon barycenter. As the 
data improve, either from more accurate range or Doppler measurements, or from new 
satellite missions, particularly planetary flyby and orbiter missions, the estimates for many 
parameters may change and the errors may decrease. 

By the end of the 1970s, measurements of the factor (2 + 27 — 3)/3 had attained 
precisions of roughly 0.5 percent, through analyses of three centuries of optical obser- 
vations of Mercury’s orbit, and of radar measurements since the middle 1960s both to 
Mercury’s surface and to the Mariner 10 spacecraft, which had close encounters with 
Mercury in 1974 and 1975. These estimates continued to improve somewhat during the 
1980s and 1990s through the steady accumulation of data. However, a major advance 
in measuring the perihelion advance was made by exploiting Mercury MESSENGER. In 
2011, MESSENGER became the first spacecraft to orbit Mercury, and range and Doppler 
measurements of the orbiter were made until the spacecraft ended its mission in 2015 
with a controlled crash on the surface of Mercury. By 2013, MESSENGER data had 
already led to dramatically improved knowledge of Mercury’s orbit. One analysis of all 
the available data yielded bounds on ¥ and 6 given by y — 1 = (—0.3 2.5) x 107° and 
B-1 = (0.2+2.5) x 10~> (Fienga et al., 2011; Verma et al., 2014; Fienga et al., 2015). The 
bound on 7 is consistent with that obtained from the dedicated test of the Shapiro time delay 
using Cassini (Section 7.2). The analysis also yielded an estimate Jy = (2.4+0.2) x 1077, 
consistent with the results from helioseismology. Taking into account the errors in y and 3 
at the level of 2.5 x 10~°, this corresponds to an effective measurement of the relativistic 
perihelion advance of Mercury at 42.980 + 0.001 arcseconds per century. This precision 
could be improved using future data from BepiColombo (Milani et al., 2002; Ashby et al., 
2007). 

A slightly weaker bound 8 — 1 = (0.4 + 2.4) x 10~* from the perihelion advance of 
Mars (adopting the Cassini bound on ¥ a priori) was obtained by exploiting data from 
the Mars Reconnaissance Orbiter (Konopliv et al., 2011). Laser tracking of the Earth- 
orbiting satellite LAGEOS II led to a measurement of its relativistic perigee precession 
(3.3 arcseconds per year) in agreement with general relativity to 2 percent (Lucchesi and 
Peron, 2010, 2014). 


Tests of the Strong Equivalence Principle 


The next class of experiments that test relativistic gravitational effects may be called tests 
of the Strong Equivalence Principle (SEP). That principle states that (i) WEP is valid for 
self-gravitating bodies as well as for test bodies (GWEP), (ii) the outcome of any local 
test experiment, gravitational or nongravitational, is independent of the velocity of the 
freely falling apparatus, and (iii) the outcome of any local test experiment is independent 
of where and when in the universe it is performed. In Section 3.3, we pointed out that 
many metric theories of gravity (perhaps all except general relativity) can be expected to 
violate one or more aspects of SEP. In Chapter 6, working within the PPN framework, we 
saw explicit evidence of some of these violations: violations of GWEP in the equations of 
motion for massive self-gravitating bodies, Eq. (6.46b); preferred-frame effects in the N- 
body equations of motion, Eq. (6.46c); and preferred-frame and preferred-location effects 
in the locally measured gravitational constant, Eq. (6.100). 

This chapter is devoted to the study of some of the observable consequences of such 
violations of SEP, and to the experiments that test for them. In Section 8.1, we consider 
violations of GWEP (the Nordtvedt effect), and its primary experimental test via Lunar 
laser ranging. Section 8.2 focuses on preferred-frame and preferred-location effects on the 
orbital motions of planets and binary systems, while Section 8.3 analyzes such effects on 
the structure of self-gravitating bodies. Another violation of SEP would be the variation 
with time of the gravitational constant G as a result of cosmic evolution. Tests of such a 
variation are described in Section 8.5. 


8.1 The Nordtvedt Effect 
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The breakdown in the Weak Equivalence Principle for massive, self-gravitating bodies 
(GWEP), which many metric theories predict, has a variety of observable consequences. 
In Chapter 6, we saw that this violation could be expressed in quasi-Newtonian language 
by attributing to each massive body inertial and passive gravitational mass tensors im’ and 
rn, which may differ from each other. The quasi-Newtonian part of the body’s acceleration 
may then be written [see Eq. (6.54)] 


(71) J (ah )Newt = (7p) , (8.1) 


where {(!”" is a quasi-Newtonian gravitational potential, and the inertial and passive mass 
~ jk «IK ri 
tensors, mj" and mp, are given by 
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where Q, and Q2* are the body’s internal gravitational energy and gravitational energy 
tensor (see Box 6.2 for definitions), and m, is the total mass energy of the body. Most 
self-gravitating bodies (planets, stars) are very nearly spherically symmetric, so we may 
approximate 


F 1 : 
OF ~ ee (8.3) 
With these approximations, we write the quasi-Newtonian equation (8.1) in the form 
ay = (=) Vil, (8.4) 
Ma 


where 


10 2 2 1,\ Q, 
(=) =14 (49 y-3 g Cay 361 a) , 


ay (8.5) 


r, 
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The most important consequence of the Nordtvedt effect is a polarization of the orbit ofa 
binary system in the presence of a third body. If the self-gravitational energy per unit mass 
of one member of the binary is larger than that of its companion, then the two bodies may 
fall toward the distant third body with slightly different accelerations, leading to the orbital 
polarization. The most notable example is that of the Earth-Moon orbit in the presence 
of the Sun (Nordtvedt, 1968c). Other examples include binary systems (especially binary 
pulsars) in the presence of the galaxy, and the recently discovered pulsar in a triple system, 
consisting of a neutron star and two white-dwarf companions. 

Even in pure Newtonian gravity, the three-body problem is notoriously difficult. 
However, if the system is “hierarchical,” meaning that the average separation of one pair 
of bodies is small compared to the distance between that pair and the third body, then the 
problem is amenable to an expansion in terms of the small parameter r/R where r and R 
are the two separations (in some situations, a better expansion parameter is the ratio of the 
period of the inner binary to that of the third body, P;,/P3). In hierarchical triple systems, 
the inner binary can be treated as a Keplerian two-body orbit to lowest order, with the third 
body producing small perturbations. 

Using our quasi-Newtonian equation of motion (8.4), we obtain, for a two-body system 
in the presence of a third body 


Mp X12 X13 
a=— (=) (mae + (ma) | , 
m/\ Mp r 


13 


t= (2), [ln — (m2 (6) 
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We denote the barycenter position of the two-body system by 
i= aide + ee (8.7) 
m m 


where m = m, + m2. Assuming that rj2§ < ro3, we expand 


J J 
x x 1 1 1 

13 _ “%c3 k k om 3 5 

3. 3. Ie ( ) 5 le*le (=) + O(7;-/Te3) » (8.8) 
3 3 Pc3 jk 3S ikm 


where x.3 = X- — X3 and x}. = x, — X-; we expand oy / ves in a similar fashion. We define 
R=re = |\xa3|,N =X3/Tea, Me = Xe/Nies We = X2¢/T2, and notice that 


Tic =(m2/m)r, roe = (mm /m)r, = me = —Mre = Vn, (8.9) 


where r = |rj2| and a = x12/ri2. With this notation we find that the relative acceleration 
a = a, — az 1s given by 


m* 


m , m3r 
ar a baN ta ar BNW: 2) —n] 
= 3 as SMW: n) —2n(N-n)—N)], (8.10) 


where m* is defined by Eq. (6.71), we have dropped the A subscript on m3, and we have 
defined 


eo Hat 5A, 
A* = Aa + 56(1—2n), (8.11) 
where 
a= ; (ee) + ‘oa ‘ (8.12a) 
1=(%)-@), en 
co * 8.12 
= ee (8.12c) 


The first term in Eq. (8.10) is the standard Newtonian acceleration; since m* represents the 
Kepler-measured mass of the two-body system we will replace it by m. The second term 
is a relative acceleration that stretches or shrinks the orbit along a line directed toward the 
third body—this is the Nordtvedt effect. From Eq. (8.5) we have that 


10 2 2 1 Go © 
b= (48 sa ee ea eS 56) ( : 2) (8.13) 
mM, m2 


The third and fourth terms originate from the Newtonian tidal interactions with the third 
body, and they typically give rise to orbital perturbations that are substantially larger than 
the nominal size of the Nordtvedt effect. 

We now consider a simplified situation in which the inner and outer orbits lie in the 
same plane (for the Earth-Moon-Sun system, their actual relative inclination is about 5°), 
the inner orbit is approximately circular (the eccentricity of the Moon-Earth orbit is 0.055) 
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and the outer orbit is circular (the eccentricity of the Earth-Sun orbit is 0.017). We calculate 
the first-order perturbations of the inner orbit created by the last three terms in Eq. (8.10). 
In the limit of small eccentricity, we can write for the inner binary ddé/dt = wy, ~ constant, 
and for the outer binary, dé/dt = w; = constant, so that 


r= a(1—ecosf) + O(e’), 

f=o-—w=uy,t—w+ Ole), 

n = eycos (wot) + ey sin (wpt) + O(e), 

A = —eysin (wyt) + ey cos (wet) + O(e), 

N = eycos (w3t — ®) + eysin (w3t — ®), (8.14) 


where e is the eccentricity of the inner orbit, and ©® is the initial phase of the third body. 
Note that 


n-N= cos(At+®), 
A. N= -—sin(At+ ®), (8.15) 
where A = wy — v3 
We now use the Lagrange planetary equations reviewed in Section 6.4.2. This must be 
done with some care, because the orbital eccentricity is small, and w loses its meaning when 


e —> 0. In such cases it is best to use the alternative variables A = ecosw and B = esinw, 
and to reexpress the Lagrange planetary in terms of A and B. We therefore write 


r = all —Acos (wot) — B sin (wst)] + O(e), (8.16) 


and convert Eqs. (6.74) to the form 


UF a * 59.612). (8.17) 
dt Wh 

dA ole... 

= aal® sin (wpt) + 2S cos (ws#)| +O(e), (8.18) 
dB 

SS —|—Rcos (wyt) + 2S sin (ws0)| + O(e), (8.19) 
dt Wpp 


where R and S are the radial and tangential components of the perturbing acceleration, 
and we have used the fact that (p/m)!/? = (wyp)~! + O(e?). To lowest order in e, R and 
S are given by 


R= Bala [3 cos*(At + &) — 1] — se cos (At + &)[5 cos*(At + ®) — 3] 
R 
- ra cos (At + »)} ; (8.20) 


S= Bal —30" sin (At + ®) cos (At+ ®) + = sin (At + ®)[5 cos*(At + ®) — 1] 


+ A 5 sin (At + o}. (8.21) 
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where ws = m;/R°. Because the orbital planes are taken to coincide, there are no 
perturbations of the inclination z or line of nodes 2. 

Substituting Eq. (8.21) into Eq. (8.19), integrating subject to the boundary conditions 
a =a, A = Ao, B = Bo at t = 0, and inserting the results into Eq. (8.16), we obtain 


r(t) =a’ [1 — A’ cos (w,t) — B’ sin (wot)| 


| w(2) [3 wp (1+ 5/A) 


2(At+ ® 
= 5 aae cos 2(At + ®) 
1 Q0 ax 3? (3 + 2wy/A) 
aR ae cos (At + ®) 
Swe (3 + 2w,/A) 
we 9M? cos 3(At+ ®) 
R w2(1 + 2u,/A 
} w| a ul cos (ar ).f, (8.22) 
ao w,—A 


where a’, A’ and B’ differ from ap, Ao and By by small corrections induced by the 
perturbations, and represent unobservable constant shifts in the orbit elements. The 
remaining terms depend on the difference A = uw, — w3, and thus have a very different 
time dependence from that of the eccentric orbit. Notice that, because we have a sinusoidal 
driving force acting on a sinusoidal oscillator, the solutions take the form of a resonant 
response, with denominators of the form 1 /(w; —N* A’), with N representing the harmonic 
degree. If N = 1, then, if w; < wy», the resonant factor becomes w7/(w; — A?) & 
wp /2w3 >> 1, leading to an enhanced amplitude. 

We first apply these results to the Earth-Moon system, with the Sun making up the third 
body. We set mz =mg,m, =m = mg /81.3,m3 = Mo, Ws = we , the angular velocity 
of the Moon around the Earth, and w3; = wg, the angular velocity of the Earth around the 
Sun, with we © 13.37wg = 2.66 x 10~° s~!. In the Earth-Moon system, A is known as 
the synodic frequency, the angular frequency of the lunar orbit relative to the Sun. This is 
to be distinguished from the sidereal frequency we , which refers to the global coordinate 
frame. We also have 


Q Q 
oS s2aguto-, “=p 9e10-", (8.23) 
Ng mE 


and R/ay ~ 390. Inserting the relevant numbers for the Earth-Moon system, setting 
5 ~ 44 x 107!°, and taking into account the fact that aj ~ 3.84 x 10° km, we see 
that the three tidal perturbations and the Nordtvedt effect have approximate amplitudes 
of 2770 km (2A), 3 km (3A), 73 km (A), and 8 meters (A), respectively. Two of the 
tidal perturbations occur at twice and three times the synodic frequency, whereas the 
Nordtvedt effect occurs at the synodic frequency; the tidal perturbations are much larger, 
but they can be cleanly separated from the Nordtvedt effect by observing many lunar 
orbits. The third tidal perturbation contributes at frequency A with an amplitude of 73 km. 
However, because this perturbation depends on parameters (we, we , R, ao, A) that are 
very accurately measured by other means, it can be predicted in advance to an accuracy 
well below the amplitude of the Nordtvedt effect. 
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In fact, an accurate calculation of the relevant amplitudes would require us to go beyond 
the first-order perturbation analysis carried out here. The perturbations induced by each 
term affect the behavior of the other terms, and so it is necessary to go to higher order in the 
orbital perturbation equations, and to include higher values of the multipole index ¢. The 
problem is that the solar perturbation of the lunar orbit is so large, as seen by the 2770 km 
leading tidal amplitude, that one must employ more sophisticated techniques, such as the 
Hill-Brown lunar theory, in order to find a sequence of perturbations that converges in a 
reasonable way. The final conclusion of such calculations is that the effective amplitude 
of the Nordtvedt term is increased from the amplitude shown in Eq. (8.22) by a factor 
approximately equal to 1 + 2wg/wg ~ 1.15, leading to an amplitude of 9.2 meters. 
Similarly, the amplitude of the competing synodic term is increased from 73 km to 110 
km, but it can still be predicted accurately enough to be subtracted from the data. 

The resulting prediction for the Nordtvedt effect is 


Or(t) ~ 9.2 ny cos (At + ©) m, (8.24) 


where 7y is the Nordtvedt parameter 
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As we explain in Box 8.1, long-term monitoring of the lunar orbit has revealed no sign of 
a Nordtvedt effect, and ny is currently constrained to be smaller than about 4 x 107+. 

Tests of the Nordtvedt effect for neutron stars have also been carried out using a class of 
systems known as “wide-orbit binary millisecond pulsars” (WBMSP). These are pulsar- 
white-dwarf binaries with orbital periods ranging from 5 to 800 days, and eccentricities 
ranging from 10~* to 10~°. In the gravitational field of the galaxy, a non-zero Nordtvedt 
effect can induce an anomalous eccentricity pointed toward the galactic center (Damour 
and Schaefer, 1991). This can be bounded using statistical methods, given a sufficient 
number of WBMSPs. The current bound is |nv(Q/m)ns| < 4.6 x 10-3, from data on more 
than 20 WBMSPs, including some highly circular systems (Stairs et al., 2005; Gonzalez 
et al., 2011). Because (Q./m)ns ~ 0.1 for typical neutron stars, this bound on ny does 
not compete with the bound from LLR; on the other hand, the presence of neutron stars 
implies that these systems test the Strong Equivalence Principle in the strong-field regime. 
Freire et al. (2012) discuss ways to carry out this test in WBMSP systems without relying 
on statistical averages over many systems. 

The remarkable pulsar in a triple system J0337+1715, reported by Ransom et al. (2014), 
may provide a test of the Nordtvedt effect that supercedes that from LLR. The system 
consists of a millisecond pulsar in a 1.6 day orbit with a white-dwarf companion of 0.2Mo, 
with a more distant 0.4Mo white dwarf in a 327 day orbit around the pair. The two orbits 
are almost perfectly coplanar and highly circular. The key parameters of the system are 
shown in Table 8.1. Inserting those parameters into Eq. (8.22), we find that the three tidal 
perturbations have amplitudes 2.2 x 10° km (2A), 2 x 10° km (3A) and 1.1 x 10* km (A), 
while the Nordtvedt effect (A) has the amplitude 


Q -Q 
dry ~ 354km (22) ( im — aime (8.26) 
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Lunar laser ranging and the Nordtvedt effect 


In the late 1950s, Dicke wondered whether one could measure a variation with time of Newton’s constant G, a 
feature that he would soon incorporate into Brans-Dicke theory. By the early 1960s, the development of pulsed 
ruby lasers and the rapid build-up of the lunar space program led him and others to propose making very 
accurate measurements of the Earth—Moon distance by bouncing laser pulses off specially designed reflectors, 
to be placed on the lunar surface by either unmanned or manned landers. Such measurements would provide 
tests of general relativity, but they would also have other important scientific benefits, such as improving our 
understanding of the Earth—Moon orbit, the librations of the Moon, and even the motions of the Earth-bound 
laser sources resulting from tidal motions and continental drift. 

The first retroreflector was deployed on the Moon by US Apollo 11 astronaut Neil Armstrong on July 21, 1969, 
and within a month, the first successful acquisition was made of a reflected laser signal. Two other US and two 
French-built reflectors were subsequently placed on the Moon by US astronauts and Soviet unmanned landers. 
Strangely, the French reflectors were never detected via laser bounces, until just recently. 

Since that time a worldwide network of observatories has made regular measurements of the round-trip 
travel time to the three US lunar retroreflectors, with accuracies that are routinely at the 50 ps (1 cm) level, 
and that are approaching 5 ps (1 mm). These measurements are fit using the method of least-squares to a 
theoretical model for the lunar motion that takes into account perturbations created by the Sun and the other 
planets, tidal interactions, and post-Newtonian gravitational effects. The predicted round-trip travel times 
between retroreflector and telescope also take into account the librations of the Moon, the orientation of the 
Earth, the location of the observatories, and atmospheric effects on the signal propagation. The Nordtvedt 
parameter is then estimated in the least-squares fit, along with several other important parameters of the 
model. 

From the first published analyses of lunar laser ranging (LLR) data in 1976 to the present, there has 
been absolutely no evidence, within experimental uncertainty, for the Nordtvedt effect. The residual orbital 
perturbation is Or(t) = (2.8 + 4.1) mm cos (At + ®). 

At this level of precision, however, we can't regard the results of LLR as a completely “clean” test of 
the Nordtvedt effect until we consider the possibility of a compensating violation of the Weak Equivalence 
Principle for the Earth and Moon. This is because the chemical compositions of the Earth and Moon differ: 
the Earth is richer in the iron group elements, while the Moon is richer in silicates. To address this issue, the 
E6t-Wash group at the University of Washington in Seattle carried out a novel torsion-balance test of WEP 
by fabricating laboratory bodies whose chemical compositions mimic that of the Earth and Moon. They found 
that the mini-Earth and mini-Moon fell with the same acceleration to 1.4 parts in 10" (BaeBler et al., 1999; 
Adelberger, 2001). Combining this limit with the results for dr(t) led to a bound on the Nordtvedt parameter 


my = (4444.5) x 1074. 


This represents a limit on a possible violation of the Strong Equivalence Principle of about 2 parts in 10’ 
(Williams et al., 2004b, 2009; Hofmann et al., 2010). 

The Apache Point Observatory for Lunar Laser ranging Operation (APOLLO) project, a joint effort by 
researchers from the University of Washington, Seattle, and the University of California, San Diego, is using 
enhanced laser and telescope technology, together with a good, high-altitude site in New Mexico, hoping to 
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improve the LLR bound by as much as an order of magnitude (Murphy et al., 2012). Strangely, the current 
limitation is not the laser ranging itself (which routinely reaches millimeter levels of precision) but the fact 
that many of the large ephemeris codes used to analyze the data have not kept pace with the improved data on 
solar-system dynamics. This effort will be aided by the fortuitous 2010 discovery by the Lunar Reconnaissance 
Orbiter of the precise landing site of the Soviet Lunokhod | rover, which deployed a French retroreflector in 
1970. Its uncertain location made it effectively lost to lunar laser ranging for almost 40 years. Its location on 
the lunar surface will make it useful in improving models of the lunar libration (Murphy et al., 2011). 

Lunar laser ranging also yielded important bounds on the pPn parameters a; and a and on a time 
variation of G (see Sections 8.4 and 8.5). 


Table 8.1 Parameters of the pulsar J0337-++1715 triple system. 


Numbers in parentheses denote errors in the last digit. 


Parameter Value 
Pulsar mass 1.438(1)Mo 
Inner companion mass 0.1975(2)Mo 
Outer companion mass 0.4101(3)Mo 


Inner orbit 


Semimajor axis 1.9242(4) Is 

Orbital period 1.629401788(5) d 
Eccentricity 6.9178(2) x 1074 
Inclination IIASA” 

Outer orbit 

Semimajor axis 118.04(3) Is 

Orbital period 327.257541(7) d 
Eccentricity 3.5356196(2) x 1077 
Inclination 39.24(1)° 


where we have scaled 1y by the approximate current bound from LLR, and the self- 
gravitational energy difference by 0.1, a typical value for neutron stars in general relativity 
(compare with the parts in 10'° for the Earth-Moon system and parts in 10+ for the 
companion white dwarf). With timing residuals for the pulsar at the level of a microsecond, 
corresponding to errors in dr of order 0.3 km, the bound on 7, could potentially be 
improved by three orders of magnitude. This estimate is consistent with results of a 
simulation by Shao (2016).! 


' In spring 2018, Archibald et al. (Nature, in press) reported an upper limit on the difference in acceleration 
between the neutron star and white dwarf of 2.6 parts in 10°. 
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However, because neutron stars involve strong gravity, post-Newtonian theory does not 
necessarily give a good approximation to the consequences of SEP violations, so this 
discussion of WBMSPs and the pulsar triple system should be regarded as only qualitative. 
In Chapter 10, we will introduce a framework for treating the post-Newtonian motion of 
systems containing compact bodies (neutron stars and and black holes). 


8.2 Preferred Frames and Locations: Orbits 


There are a number of observable effects of a preferred-frame and preferred-location type 
in the orbital motions of bodies governed by the pPpN N-body equation of motion (6.45). To 
determine these effects, we consider a two-body system whose barycenter moves relative 
to the universe rest frame with velocity w, and that resides in the gravitational potential 
Uext of a distant body (the galaxy will prove to be the dominant such body). In the N- 
body equations of motion (6.45), we will ignore all the self-acceleration terms except 
the term in Eq. (6.46a) that depends on a3 and w. We will also ignore the Newtonian 
acceleration, the Nordtvedt terms, and all the post-Newtonian terms that were included in 
the classical perihelion-shift calculation. Thus, from Eqs. (6.46a), (6.46c), and (6.50) we 
have the additional accelerations 


1 1 
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_ 5 . [ayy _ (ay _ 2a2)Vv2 + 202W] Ww, 
ag = {1=2;n—> —-n}, (8.27) 
where n = X12/r, rg = |xXic|, and ng = XxiG/rig. In obtaining Eq. (8.27), we have 


ignored terms of order mgr/rz,, mgr*/ri, and so on. The first two terms inside the braces 
in Eq. (8.27) are constant, therefore they can simply be absorbed into the Newtonian 
acceleration by redefining the gravitational constant (they are related to the constant 
corrections to Gy in Eq. (6.100)). 

We will defer a discussion of the term proportional to a3); /m;(w+¥1) x wy, to later in 
this section. 

The relative acceleration a = a, — a is given by 


mx | 1 3 2 mx | 1 
a= a [padwevt 702(w nt) ae [sade aan | w 
+ ene [2ng(ng -n) —3n(ng- n)’| : (8.28) 
i re 


where we have made use of Eqs. (6.67) and (6.68). 
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Following the method outlined in Section 6.4.2, we calculate the secular changes in the 
orbit elements resulting from these perturbations. Using Eq. (6.73), we define 


ep =N|g=u = en cosw+e, sinw, 


€9 = Alg=aw = —eq Sinw +e cosw, 


h = ep X eg =en Xe1, (8.29) 
where ep is a unit vector pointing toward the pericenter (it is the Runge-Lenz vector divided 
by the orbital eccentricity) and eg = hx ep; eg is a unit vector pointing along the ascending 
node (see Figure 6.1), ande, = h x eg. For any vector A, we then define components Ap, 
Ag, An, Aq, and A, accordingly. 

The secular changes in the orbit elements are then given by 


Aa=0, 
m\ 2 

Ae = —7ra, A (7) wp(1 — e”)F(e) + 2ma2wpwoev 1 — e2F(e)? 
P 


m 
— 4n€—nepngge 1— eF(e) ; 
'G 


1/2 [7 2 
Aw = —7a,\A (“) woo — Ta (wp — wo) F(e) 
Dp e 


m 
+ ang (ne — Neg) F(e)? ; 


1/2 
Au =TajA (7) wp sin (w)eF(e) — 2magwywe 
P —e 
mG Fie) 
+ 4r€ —ngtcgk ==. , 
ore ee 
. m\ 1? (e) 
sintAQ = —7ra,A (“) wn cos (w)eF(e) — 2702WaWs : 
P =e" 
mG F(e) 
+ 4r€ —ngyngs ————_ _., 8.30 
. 7g oNnes Aa (8.30) 
where 
1 
ec) = SSS 8.31 
) ite (8.31) 
and for any vector A, 
AR = Ap cos (w) — Ag sin (w) V1 — e?, 
Ag = Ap sin (w) + Ag cos (w)V1 — e?. (8.32) 
In the small-eccentricity limit, to first order in e, these results simplify to 
Aa=0, (8.33a) 
T my 1/2 T m 
Ae = —zuA (=) Wp + 7 c2wpwee _ mE“ nepnane (8.33b) 
T m\'/2 we 3 F TMG / 9 F 
Ag =—LaiA ( ‘ ) pT 702 WR MB) + FETE (nee — Bq) » (8.330) 
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1/2 
A= Suk (=) whe sin (w) — Ta2WAWE + nee nganca : (8.33d) 
a 'G 
1/2 
sintAQ = —FaiA (=) wne cos (w) — Ta2wyw_ + ne tains, , (8.33e) 
a 'G 


where, for e = 0, Eq. (8.32) implies that Az = Ag, and As = A,. 

It is tempting to interpret these secular changes as implying linearly growing values 
of the orbital elements. However, the expressions in Eq. (8.33) depend on w, both from 
explicit w dependence, and via the P and Q components of w and ng. The pericenter angle 
is actually advancing at an average rate dw/dd = w' ~ 3m/p (see Eq. (6.78)), which we 
anticipate is much larger than the preferred-frame effects shown in Eq. (8.33c) — the goal 
is to set strong upper bounds on such effects. Thus the variations in the orbit elements will 
be modulated on a pericenter precession timescale and could even change sign. So in order 
to find the proper long-term evolution of the elements, we define, for a given element X,, 
dX, /do = AX, /27, insert w = wo + w'¢ in the right-hand sides (including in the P and 
Q components of various vectors) and integrate with respect to ¢. Inserting the resulting 
integrals for e and a, into the expression 


r= all —ecos(¢—w) + O(e’)], (8.34) 


and expanding to first order in the preferred-frame perturbations, we obtain 


1 1/2 
" =1-eycos(¢ wo — wd) aA (=) : (w cos ¢@ — We sind) 
a 4 ar 
@&  »7sinw'd 
a2 


a 


[2Waw_ cos (bd + wo) + (W, — WO) sin (¢ + wWo)] 


w! 


€y9 .Mg sinw’ . 
Wea gas = ? [2ncgang. cos (¢ + wo) + (ng. — nea) sin(d+wo)]. (8.35) 


4 'G 


The first term in Eq. (8.35) is the normal contribution to r/a resulting from the small 
eccentricity, with the pericenter advancing at a rate w’. The second term is a forced 
eccentricity of the orbit, with an amplitude proportional to (w2, + w2,)!/2 = (1-w?)'/? = 
sin, where ~ is the angle between the orbital angular momentum h and the velocity 
w relative to the preferred frame, and a phase given by tan! (—wWe/w.) (Damour and 
Esposito-Farése, 1992b). This effect is present even in the limit eg — 0. The final terms 
are also polarizations of the orbit, proportional to e9, with an amplitude modulated by the 
factor sin w'd/w’. However, they vanish in the limit e9 — 0. 

The other important effect of the preferred-frame perturbations is to cause the orbital 
angular momentum to precess. Since h = sinu(sin Qey — cos Ney) + cos vez, variations 
in h are given by 


Ah = sintAQeg — Are, . (8.36) 
Inserting Eqs. (8.33d) and (8.33e), and noting that eg cosw + e, sinw = ep and that 


A\ eg — Age, =A X h, we obtain 


“ 1/2 ‘ x 
Ah = —FuA (=) Weep — Taw, (Ww X A) + Ine nga (ng xh), (8.37) 
a 'G 
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leading to a precession of the angular momentum vector h. Because of the variation of 
e in Eq. (8.33b), the magnitude of the angular momentum h = ,/ma(1 — e?) will also 
change. The first two terms, dependent on the preferred-frame parameters a, and a2 do not 
come as a surprise. We saw in Section 4.4 that, in semi-conservative theories of gravity, a 
system’s total angular momentum is conserved only if the system is at rest in the preferred 
frame; for moving systems, angular momentum need not be conserved. The third term may 
seem surprising, since € can be non-zero even in a fully conservative theory, where total 
angular momentum is conserved. In this case, because of the distant mass mg, the binary 
system is not a truly isolated system, and presumably there is a compensating change in 
the angular momentum of the orbit of the binary around the distant mass that conserves the 
total angular momentum. Changes in the angular momentum of a binary system induced by 
tidal interactions with a third body are well-known (Kozai-Lidov effects); here the effect 
is induced by a post-Newtonian violation of SEP. 

We turn finally to the self-acceleration term in Eq. (8.27) proportional to a3. We assume 
that only body 1 possesses enough self-gravitational binding energy and spin to generate 
such a term, so that the relative acceleration is given by 


1 Q 
a@=—~03—wxw, (8.38) 
3 > m, 


where we drop the contribution from the orbital velocity v;. Calculating the secular changes 
in the orbit elements, we obtain 


hax 0; (8.39a) 

2 /} — p2 
Ae =—1-——* Bo, (8.39b) 

m 
2 /1 pe 
Ag = 1 Bp, (8.39c) 
em 
Riaage he (8.394) 
(———7 6 WwW, E 

mv1 — e2 " 
in.AQ Be Fs (8.39e) 
sin t = 7 ————B, sinw, 39e 

mv/1 — e2 


where B = a3(Q)/m)(w x w1). As before, we integrate with respect to ¢ including the 
dominant pericenter precession proportional to w’, assume that e is small, and substitute 
the results for e and a into Eq. (8.34) to obtain 


é = 1—e9cos(¢ — wo — 4d) 
a 


1 2-0 

+ ola. [(w x w1)-e, sing + (w x w))-egcos¢)] . (8.40) 

Again we recognize a forced eccentricity, with amplitude proportional to |w x w;|sinw, 

where w is the angle between h and w x wy. If the body’s spin vector w is perpendicular 
to the orbital plane, then 7) = 90°. 
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8.3 Preferred Frames and Locations: Structure of Massive Bodies 


8.3.1 G, and distortions of massive bodies 


In Section 6.6, we found that some metric theories of gravity predict preferred-frame and 
preferred-location effects in the locally measured gravitational constant G, as measured by 
means of a Cavendish experiment: 


G. =1 = [48-1 -3- £8 +x) ME — 5 fer — a3 — a (1 = yw 
— 5a (1 ~ 3x) (9)? + € (1 —3y) "Elng-m), (8.41) 


where w is the velocity of the body relative to the preferred frame, mg, rg and mg are the 
mass, distance, and unit vector for the external body (summed over all external masses, if 
necessary), and 

Y= — : (8.42) 
where m and J are the mass and spherical moment of inertia of the source body, and r and 
n are the distance and unit vector between the two masses being used to measure G,. 

These effects represent violations of SEP. Unfortunately, present-day laboratory mea- 
surements of G are only accurate to parts in 104 (Mohr and Taylor, 2005), and so are 
unlikely to shed light on the post-Newtonian corrections to G_ shown in Eq. (8.41), 
which are at levels of 10~° and smaller in the solar system. However, such variations and 
anisotropies in Gy, should affect the structure of self-gravitating bodies such as the Earth, 
in a manner that is analogous to the tidal effects of the Sun and Moon. 

The tidal forces caused an external body induce variations in the local acceleration of 
gravity g on the surface of a chosen body. The measured variations are affected not only 
by the direct tidal accelerations, but also by the displacement of the surface relative to the 
center of the body and by the redistribution of mass inside the body. One can show that the 
variations in g are then related to the external driving variations by a numerical factor, a 
combination of so-called “Love numbers,” which depend on the detailed structure of the 
body (see section 2.4 of PW for a pedagogical discussion). For the Earth, the numerical 
factor turns out to be approximately 1.16. These solid Earth tides are to be distinguished 
from ocean tides, which are much more complex. 

On the Earth, variations in g are measured routinely and to very high precision by global 
arrays of gravimeters, as a means to learn about the structure of the Earth via measurement 
of the Love numbers. We wish to use such measurements to set limits on the PPN parameters 
that appear in Eq. (8.41). We will assume that, for the Earth, we can approximate 

Ag = eo, (8.43) 
g G. 
A more accurate calculation of Ag/g would consider the PPN equation of hydrostatic 
equilibrium obtained from the static limit of the PPN equation of hydrodynamics (6.25), 
with perturbations induced by the various preferred-frame and preferred-location terms, 
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and then would proceed with the same kind of perturbation calculation that is done for tidal 
perturbations, as detailed in PW, section 2.4. In the Earth’s interior the perturbing force 
generated by the variations in Gy is proportional to p* VU, whereas the tidal perturbing 
force is proportional to the distance from the center of the Earth. If the Earth’s density 
were uniform, then p* VU would be proportional to 7, and the Love numbers would be the 
same as in the Newtonian tidal case. However, in Newtonian tidal theory, the Love number 
factor for gravimeter measurements (1.16), is not very sensitive (5 percent) to variations 
in the model for the Earth, thus we do not expect it to be sensitive to a different disturbing 
force law. 

Consider the first post-Newtonian term in Eq. (8.41). Because of the Earth’s eccentric 
orbital motion, the external potential produced by the Sun varies yearly on Earth by only 
a part in 10!°, too small to be detected with confidence by Earth-bound gravimeters. The 
time-varying effects of other bodies (planets, the galaxy) are even smaller. The second term 
is proportional to w? = w)+2wo-v+v’, where wo ~ 107? is the velocity of the solar-system 
barycenter relative to the preferred frame and y ~ 10~ is the Earth’s orbital velocity. The 
first term is constant and the third varies by only a part in 10!°, but the middle term varies 
with the full amplitude « wov because of the orbital motion of the Earth. Because of 
the rotation of the Earth (varying n) the two anisotropic terms in Eq. (8.41) will generate 
variations in gat various harmonics of the Earth’s rotation rate. In addition, (w-n)? will vary 
because of the orbital contribution to w. Finally, the preferred-location term should include 
all gravitating matter that is not part of the cosmological background used to establish the 
asymptotically flat PPN coordinate system. Therefore, it should include the Sun, planets, 
stars, the Galaxy, and so on, at least out to scales where the matter distribution is sufficiently 
isotropic so as to wash out the variation with changing n. In this case, the factor mg/rg is 
dominated by our galaxy (~5 x 1077) followed by the Sun (~1078). 

In order to illustrate how the variations in Gy, would appear in practice in a time series 
of gravimeter readings, we will select the variations with largest amplitude in Eq. (8.41), 
and write 


AG, ( 1 ) 
= Q{ a3 an Wo:v 


1 lim 
+ 7a2(wo-n)” — 56 - 


r (ng-n), (8.44) 


'G 


where we have used the fact that y ~ 1/2 for the Earth, and where here the subscript 
G refers to the galaxy. We work in geocentric ecliptic coordinates, and assume a circular 
Earth orbit, with the Earth at vernal equinox at ¢ = 0. Then 


v=v(sinwte, — coswte,) , 
Wo = cos By, (cos Ay ey + sinAye,) + sin Bye, , 


Ng = cos Bg (cos Ag ey + sinAge,) + sin Beez. (8.45) 


The latter two equations define the ecliptic coordinates (A,,, 8) and (Ag, 8g) of the solar 
system’s velocity and of the galactic center, respectively. For a gravimeter stationed at 
Earth latitude L, 
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n= cosL cos (Mt — e)e, + [cos L sin (Qt — €) cos@ + sinL sin 6] e, 

— [cos L sin (Ot — €) sin@ — sinL cos 6] e,, (8.46) 
where « is related to the longitude of the gravimeter on Earth, and @ is the “tilt” (23.5°) 
of the Earth relative to the ecliptic plane. Then the three amplitudes in Eq. (8.44) can be 
expressed as 

Wo + ¥ = wovcos G,, sin (wt — Aw), 
1 : : : ‘ 
(wy -n)? = 5 Wo[ COS” S cos’L + 2 sin?6, sin’Z + sin 26,, sin 2L cos (Qt — € — ay) 
+ cosd,, cos*L cos 2(Ot — € — aw) ], 


(ng: n)? = [ cos*dg cos’L + 2sin?dg sin’L + sin 26g sin2L cos (Qt — €— ag) 


Nile 


+ cos*5g cos’ cos 2(Qt — € — ag)| : (8.47) 


where (ay, dy) and (ag, dg) are the Earth-oriented equatorial coordinates. The ecliptic 
and equatorial coordinates are related by 


sind = sinGcos@+ cos @sindsindA, 
cos dcosa@ = cos cos, 


cos d sina = —sinZsin@+ cos (cos é@sin2. (8.48) 


The two anisotropic terms produce variations in Ag/g at frequencies 2Q and Q, 
corresponding to periods of half a sidereal day (“semidiurnal”) and a sidereal day 
(“diurnal”), respectively. The dominant Newtonian tides occur at frequencies 2(Q — w¢ ) 
and 2(Q — we), corresponding to half a lunar and solar day respectively. In fact, because 
of factors such as the tilt of the lunar orbit relative to the ecliptic, there are numerous 
sidebands of the diurnal and semidiurnal tides at twa, +2wg, tw and +2we¢ . Similarly, 
if we had included the orbital contributions to (w - n)? and the solar contribution to 
the preferred-location term, there would be a corresponding array of sidebands in the 
PPN signal. Thus by analyzing gravimeter data taken over long periods of time, and 
incorporating Earth models that predict the Newtonian tidal amplitude to relatively high 
accuracy, one can endeavor to place bounds on a2 and €. We will discuss specific 
experiments and the bounds they provide in Section 8.4. 

The wo - v term produces an annual variation in the size of GL. Such variations would 
cause the Earth to expand and shrink slightly, changing its moment of inertia, and thereby 
changing its rotation rate on an annual basis. In Section 8.4, we will discuss the bounds 
placed by comparing such variations with the observations. 


8.3.2 Precession of the spin axis of massive bodies 


Another effect induced by preferred-frame and preferred-location violations of SEP is an 
apparent failure of conservation of angular momentum, leading to a precession of the axis 
of a rotating massive body. We have already seen an example of this in the precession of the 
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angular momentum of a binary system in Section 8.2, Eq. (8.37). Here we will derive the 
analogous effect for a rotating, self-gravitating body at rest in the PPN coordinate frame. In 
the PPN equation of hydrodynamics (6.25), we split the various potentials U, ®5, ®j, and so 
on into an internal part and a part due to an external body, keeping only terms proportional 
to mg/rg. We also include the preferred-frame terms. Keeping only such terms along with 
the Newtonian Euler equation, we obtain 


dvi m 1 
a = p*U; [ = (484+ 2y-1-G—-4€) + x=(a3 ay) | 
t 'G 2 


1 
[ (2 2. “lp, PA jem E — nk nt = -anww™ 
'G 'G 2 


1 1 
+ 7 203 = a)w Vp; + nol [w/Uo + aviv] = aw X ojx ‘5 (8.49) 


where U, X, and V; are potentials generated by the rotating body, with V; dependent on 
v*, the velocity of the fluid relative to the center of mass. The coefficient of mg/rg in 
the first line of this expression differs slightly from the corresponding term in Eq. (8.41) 
because here we work in PPN coordinates, whereas Gy in Eq. (8.41) was defined using 
proper distances and times. We now wish to calculate the rate of change of the angular 
momentum tensor, dJ’/dt = 2 f p*x!(dv/l/dt)d’x. We use the fact that terms such as 
f p*x" Ua x or f x!ip/ld’x are already known to vanish in establishing the Newtonian 
conservation of angular momentum, and that quantities such as [ p*x'Vid°x, [ p*x'v,.U/d?x 
or | px Ud?x vanish because they contain odd numbers of spatial indices, and we make 
our standard assumption of reflection symmetric bodies. The only terms that survive are 


dj! » er/Ve 
a / p*xlixilin By [26 nt — aww" : (8.50) 
t 'G 


Recalling that ¥ = f p*|x — x’|d°x’, it is straightforward to show that 
/ p*xlixalen By = Qkligim Ae Qlig/k , (8.51) 


where (2 is the self-gravitational energy tensor for the body. It satisfies the virial relation 
2T! + QY + P5Y = 0, where P is the integral of the pressure, and 7” is the kinetic energy 
tensor, given for a uniformly rotating body by 


i 
Jia 5 | evvias - xe ett uf f prxrstd's, (8.52) 


For a nearly spherical body, f p*x"x4d°x ~ 4 641., where J, = f p*r? sin’@d°x is the 
moment of inertia about the rotation axis e, and thus 


Ti= sles? (5% — ee) . (8.53) 


Pulling everything together and recalling that the angular momentum vector is given by 
J‘ = }c/J! we find that the torque on the body is given by 


1 
dd = —=J,u" [ree -ng(e X ng) — are: we x »)| . (8.54) 
dt 2 'G 


186 


Tests of the Strong Equivalence Principle 


This is completely equivalent to the change in orbital angular momentum shown in 
Eq. (8.37). There is no a, term in Eq. (8.54) because of our assumption of reflection 
symmetry for the body; this is equivalent to setting e = 0 (circular orbit) in Eq. (8.37). 


8.4 Preferred Frames and Locations: Bounds 
on the ppn Parameters 


Here we review the bounds that have been placed on the preferred-frame parameters a1, a2 
and a3 and on the preferred-location or Whitehead parameter € by observational searches 
for the effects described in Sections 8.2 and 8.3. 

To be concrete, we must employ values for the velocity of the preferred frame and the 
galactic potential and their associated directions. In any theory of gravity with a preferred 
frame, it is natural to assume that the preferred frame is the rest frame of the overall mass 
distribution of the universe, as represented by the frame in which the cosmic microwave 
background (CMB) is isotropic. From the measured dipole anisotropy of the CMB, we 
know that the solar-system is moving with a speed wo ~ 370km st ~ 12x 1073, 
Table 8.2 shows the precise value along with the direction in galactic, ecliptic, and 
equatorial coordinates. 

The galactic potential mg/rg is less precise, but we can estimate it within a factor of a 
few. The solar system’s velocity within the galaxy is 230 km/s; assuming that v? ~ mg/r¢ 
we can estimate mg/rg ~ 5 x 10~’. This assumes that the mass of the galaxy is more or 
less concentrated at the center, whereas we know that the bulk of the mass of the galaxy 
is in a roughly spherical halo of stars and dark matter, substantially larger in size than the 
visible Milky Way. 

However it can be shown using a simple density model for the galaxy that the rough 
estimate we have adopted holds up within a factor of a few. First, we note that the important 
quantity (mg/rg)n',n/, is a single-mass version of the potential tensor U? = 5/U — X¥. 
For a spherically symmetric distribution of matter, X is given by 


4n [" 4n 
X=rm(r) + = / p'rsdr + = / pl’ (P+ 3r?)dr, (8.55) 
0 r 


where p is the mass density and m(r) is the mass inside radius r. Then, for spherical 
symmetry, 


7 aX 1,., oo gh 
ai 5 pial 
Xx nn! 5 = n'n!) 


Lgl E ao] | 6f Ke la es i 7 pdr (8.56) 


r r 3r3 3 


where /(r) is the spherical moment of inertia inside radius r. Since we are only interested 
in anisotropic effects, we will drop all terms proportional to 6”. 

To compare with our rough estimate we consider a specific density distribution given by 
A4np = a/r-, forr < r,, and 4p = a/?’, for r > r¢, where r; is a core radius meant to 
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Table 8.2 Parameters of the preferred frame velocity #9 and the galactic 


potential at the solar system. 


Galactic Ecliptic Equatorial 
(, b) (A, 8) (a, 6) 

Velocity, 368 + 2km s7! (1.23 x 107%) 
Longitude 263° .85 AE 55 167°.85 
Latitude 48° .25 —11°.13 —6°.88 
Galactic potential ~ 5 x 1077 
Longitude 0° 266° .84 266°.41 
Latitude 0° —5°.54 —28°.94 


represent the mass of the inner part of the galaxy, and a is a parameter. The 1 /7* density 
distribution is meant to model the dark matter halo, and to yield a flat rotation curve for 
the outer reaches of the Milky Way, in rough agreement with observations. By noting that 
a circular orbit in a spherical potential satisfies, v?/r = a, = m(r)/r’, and considering the 
case r > re, we can fit a = v*/(1 — 2q/3), where q = r./r, and find that 


2 l-q+¢@/5 
UL = LV nin! 8.57 
ey Tonnes een 
For the case r < r,, a similar calculation gives 
vii 22 i. 
Ga 5” NGNG » (8.58) 


independent of g. Thus for v ~ 230 km/s, we find an amplitude 2 — 4 x 107’. Note 
from Eq. (8.56) that only the matter inside our radius has an effect on the anisotropy. Even 
though the galaxy and its halo are not strictly spherically symmetric, this is unlikely to 
alter the estimate significantly. Table 8.2 shows our adopted value for the galactic potential 
along with the directions to the galactic center in various coordinates. 

The anisotropies in Gy in Eq. (8.44) provided the first bounds on PPN preferred-frame 
and preferred-location parameters. Surveying the geophysical literature of the time, Will 
(1971a) pointed out that data on the semidurnal tides of the Sun and Moon agreed with 
Newtonian predictions to the level corresponding to amplitudes of 10~°, resulting in the 
bounds |a2| < 3 x 107? and || < 5 x 1073. An explicit experimental test was performed 
by Warburton and Goodkind (1976), who used an array of superconducting gravimeters 
at a site in southern California to take an 18-month record of tidal amplitudes and phases. 
They included the full array of PPN variations, not just the ones shown in Eq. (8.44), and 
carefully accounted for the disturbing effects of tides of the nearby Pacific Ocean. The 
result was the pair of bounds |a2| < 4 x 1074 and |€| < 1073. Nordtvedt and Will (1972) 
placed modest bounds on various combinations of the parameters using perhelion advance 
information and limits on any anomalous annual variation in the rotation rate of the Earth, 
the latter induced by the wo - v term in Eq. (8.44). 
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A tighter bound on a2 was set by Nordtvedt (1987), by considering the spin precession 
effect of Eq. (8.54) acting on the Sun. Focusing on the preferred-frame term, the precession 
takes the form dJ/dt = Q x J, where J = I,we and Q = —Fa2W(e -w)w, where w is the 
rotation angular velocity of the Sun. Over a time 7, the change in direction of the spin axis 


would be of order 


A T 
“ ~ <onw? sin B,, cos ,, . (8.59) 


Inserting w ~ 2.8 x 10~°s~!, T = 4x 10° years, and requiring that the change be no larger 
than the known 7° tilt of the Sun’s axis relative to the ecliptic, we obtain |a2| <2 x 107°. 

Lunar laser ranging yielded the bounds a; = (—0.7 + 1.8) x 1074 and aj = (1.8 + 
5.0) x 10~> from the absence of any induced eccentricity (Eq. (8.35)) in the lunar orbit 
(Miiller et al., 1996, 2008). 

Substantial improvements in bounds on these parameters came from analyzing timing 
data from a variety of isolated and binary pulsars. 

In TEGP, we pointed out that, for an isolated spinning body moving relative to the 
preferred frame, the term proportional to a3 in Eq. (8.27) produces a self-acceleration 
(recall that a3 is also a conservation law parameter). If the body were a pulsar, this would 
induce a change in the observed pulse period P, at a rate dP,/dt ~ 2 x 10~4a3, for 
Q/m ~ 0.1 and w ~ 300 km/s. Given that half of the observed pulsars have dP, /dt lying 
between 10~'4 and 107!5, we inferred the upper bound |a3| < 107!°. This bound was 
improved dramatically using data on ~ 21 WBMSPs. Searches for the induced eccentricity 
of Eq. (8.40), combined with statistical arguments to mitigate the effects of unknown 
orbital orientations of the observed population of WBMSPs led to the impressive bound 
|a3| < 4 x 10~7° (Bell and Damour, 1996; Stairs et al., 2005). 

Similar searches for eccentricities induced by the a, term in Eq. (8.35) resulted in 
bounds on a, as small as a few parts in 10° (Wex, 2000; Shao and Wex, 2012). The 
tightest bound used a specific binary pulsar J1738+0333, whose orbit around its white- 
dwarf companion has an eccentricity 3.4 x 10~’. The analysis was helped by the fact 
that the white dwarf is bright enough to be observed spectroscopically, leading to accurate 
determinations of the key orbital parameters. Furthermore, because the pericenter advances 
at a rate of about 1.6deg yr!, the decade-long data span made it possible to partially 
separate any induced eccentricity, whose direction is fixed by the direction of w, from 
the natural eccentricity, which rotates with the pericenter. For this system the result was 
|ai| < 3.4 x 107° (Shao and Wex, 2012). 

Limits on a2 and € were obtained by looking for the precession of the orbital plane of a 
binary system [see Eq. (8.37)]. Such a precession would lead to a variation in the “projected 
semimajor axis” of the pulsar, a, sinz, a quantity that is measured very accurately in 
binary pulsar timing (see Section 12.1). Combining data from the two WBMSP systems 
J1738+0333 and J1012+5307, Shao and Wex (2012, 2013) obtained the bounds |a2| < 
1.8 x 1074 and |é| < 3.1 x 1074. 

These bounds were improved substantially by searching for the precession of the spin of 
isolated pulsars (see Eq. (8.54)), the analogue of the precession effect studied by Nordtvedt 
(1987) in the solar context. If the spin of the pulsar precesses, then the pulse profile should 
change as the line of site to the pulsar slices different parts of the emission region. Using 15 
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2 Neutron stars and ppn parameters 


In this chapter, we have used a number of observations of binary and isolated pulsars to place bounds on 
preferred-frame and preferred-location parameters. And observations of the pulsar in a triple system may 
soon place a bound on the Nordtvedt parameter. 

However, the ppn framework does not strictly apply to such systems, because the neutron stars are compact, 
strong-field objects, in the sense that the post-Newtonian expansion parameter € can be of order 0.1 — 0.3 
near and inside these objects. And in theories of gravity with spontaneous scalarization effects, the effective 
internal values of € can be much larger than unity. 

Accordingly, one should treat bounds on the ppn parameters inferred from such systems with some caution. 
In some alternative theories, the effect of the compact body could be to rescale the relevant Pp parameter by 
a factor 1 + €, where e ~ m/r, where mand rare the mass and radius of the neutron star. Since the bounds 
on many of these parameters are very small, such corrections of order tens of percent will be insignificant. 
On the other hand, in some theories, additional parameters may be present for systems containing compact 
objects, and the relation between these paramaters and the true PPN parameters may be more complicated 
than a simple rescaling of the pn parameter. This will be addressed in Section 10.3.1, where we will introduce 
a “modified EIH formalism,” a parametrization of the Lagrangian for systems of compact bodies, rather than 
of the spacetime post-Newtonian metric. These parameters will depend on the theory of gravity and on the 
internal structure of the compact bodies in the system. In the limit where there are no compact bodies, these 
parameters will have specific limits in terms of the PpN parameters for semiconservative theories. For example, 
using the modified EIH formalism, we will derive an equation forthe preferred-frame perturbations of a binary 
system of compact bodies that generalizes the preferred-frame terms in Eq. (8.28). 

Inthe literature on tests of SEP using neutron stars, it is conventional to designate the relevant parameters 
using “hats,’ for example a1, 2, or é, to emphasize that the parameters may not be quite the same as the 
corresponding PPN parameters. 


years of observations of B1937+21 and J1744-1134, two isolated millisecond pulsars with 
very stable pulse profiles, Shao et al. (2013) and Shao and Wex (2013) found no evidence 
of pulse profile changes, and thereby established the strong bounds |az| < 1.6 x 107° 
and |€| < 3.9 x 107°. The use of isolated pulsars rather than binary pulsars was crucial, 
in order to avoid any complication from the well-known geodetic precession of spins that 
occurs in binary pulsar systems (see Section 12.1.2). 


8.5 Constancy of Newton’s Gravitational Constant 
SS EEE 


Most theories of gravity that violate SEP predict that the locally measured Newtonian 
gravitational constant may vary with time as the universe evolves. For the theories 
discussed in Chapter 5, the predictions for G/G can generally be written in terms of time 
derivatives of the asymptotic dynamical fields or of the asymptotic matching parameters. 
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Other, more heuristic proposals for a changing gravitational constant, such as those due to 
Dirac, cannot be written this way. Dyson (1972) gives a detailed discussion of these earlier 
proposals. Where G does change with cosmic evolution, its rate of variation should be of 
the order of the expansion rate of the universe, that is 

G 

—~oh, 8.60 

G 0 (8.60) 


where Ho is the Hubble expansion parameter whose value is 


Hy =70+2km s~'Mpe7! ~7 x 107! yr7!, (8.61) 


(Bennett et al., 2013; Planck Collaboration et al., 2014; Grieb et al., 2017), and o is a 
dimensionless parameter whose value depends upon the theory of gravity under study and 
upon the detailed cosmological model assumed. 

For general relativity, of course, G is precisely constant (o = 0). For Brans-Dicke theory 
in a spatially flat Friedmann-Robertson-Walker (FRW) cosmology, o ~ —(2 + w)~! [see, 
e.g., Section 16.4 of Weinberg (1972)]. In Einstein-4ther and Khronometric theory, G is 
constant, basically because the auxiliary vector field is constrained to have unit norm. 

However, several observational constraints can be placed on G/ G, one kind coming 
from bounding the present rate of variation, another from bounding a difference between 
the present value and a past value. The first type of bound typically comes from LLR 
measurements, planetary radar-ranging measurements, and pulsar timing data. The second 
type comes from studies of the evolution of the Sun, stars and the Earth, Big-Bang 
nucleosynthesis, and analyses of ancient eclipse data. Given the present uncertainties in 
laboratory measurements of G (together with significant disagreements among measure- 
ments that are larger than the individual errors), it is unlikely that laboratory tests of G will 
contribute interesting bounds for the foreseeable future. For a time during the 1970s, van 
Flandern (1975) and others argued for a nonzero G, based on observations of the recession 
of the Moon from the Earth. Today those claims are given little credence, particularly in 
the face of much more precise, independent bounds on G /G. 

The best limits on a current G/G come from improvements in the ephemeris of Mars 
using range and Doppler data from the Mars Global Surveyor (1998 — 2006), Mars Odyssey 
(2002 — 2008), and Mars Reconnaissance Orbiter (2006 — 2008), together with improved 
data and modeling of the effects of the asteroid belt (Pitjeva, 2005; Konopliv et al., 2011; 
Pitjeva and Pitjev, 2013). Since the bound is actually on variations of GMa, any future 
improvements in G/ G beyond a part in 10" per year will have to take into account 
models of the actual mass loss from the Sun, due to radiation of photons and neutrinos 
(~ 0.7x 10—!3 yr—!) and due to the solar wind (~ 0.6 x 10—}3 yr~!). Another bound comes 
from LLR measurements (Williams et al., 2004b). For earlier results see Dickey et al. 
(1994), Williams et al. (1996) and Miiller et al. (1999). This precision could be improved 
by the future BepiColombo mission to Mercury, scheduled for launch in late 2018 (Milani 
et al., 2002; Ashby et al., 2007). 

Although bounds on G/G from solar-system measurements can be correctly obtained in 
a phenomenological manner through the simple expedient of replacing G by Go + Gli to) 
in Newton’s equations of motion, the same does not hold true for pulsar and binary pulsar 
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Table 8.3. Tests of the constancy of G. 


G/G 
Method (Omer) Reference 
Mars ephemeris 0.1+41.6  Konopliv et al. (2011) 
—0.6+0.4  Pitjeva and Pitjev (2013) 

Lunar laser ranging 4+9 Williams et al. (2004b) 

0.7+3.8 Hofmann et al. (2010) 
Binary and millisecond pulsars —7 +33 Deller et al. (2008) 

Lazaridis et al. (2009) 

Helioseismology Ose Me Guenther et al. (1998) 
Big-bang nucleosynthesis 0O+4 Copi et al. (2004) 


Bambi et al. (2005) 


timing measurements. The reason is that, in theories of gravity that violate SEP, such as 
scalar-tensor theories, the “mass” and moment of inertia of a gravitationally bound body 
may vary with G. Because neutron stars are highly relativistic, the fractional variation in 
these quantities can be comparable to G/ G, the precise variation depending both on the 
equation of state of neutron star matter and on the theory of gravity in the strong-field 
regime. The variation in the moment of inertia affects the spin rate of the pulsar, while 
the variation in the mass can affect the orbital period of a binary in a manner that can 
subtract from the direct effect of a variation in G, given by P, {P= 9G /G (Nordtvedt, 
1990). Thus, the bounds quoted in Table 8.3 for binary and millisecond pulsars are theory- 
dependent and must be treated as merely suggestive. 

Bounds from helioseismology and Big-Bang nucleosynthesis (BBN) assume a model 
for the evolution of G over the multi-billion-year time spans involved. For example, the 
concordance of predictions for light elements produced around 3 minutes after the Big 
Bang with the abundances observed in the present universe indicate that G then was within 
20 percent of G today. Assuming a power-law variation of G ~ t~® then yields the bound 
on G/G today shown in Table 8.3. 


Other Tests of Post-Newtonian Gravity 


A number of tests of post-Newtonian gravitational effects do not fit into either of the two 
categories, classical tests or tests of SEP. These include the search for relativistic effects 
involving spinning bodies (Section 9.1), the de Sitter precession (Section 9.2), and tests 
of post-Newtonian conservation laws (Section 9.3). Some of these experiments provide 
limits on PPN parameters, in particular the conservation-law parameters ¢), C2, ¢3, and Ca, 
that were not constrained (or that were constrained only indirectly) by the classical tests 
and by tests of SEP. Such experiments provide new information about the nature of post- 
Newtonian gravity. Others, such as measurements of spin effects, constrain values for PPN 
parameters that are already better constrained by the experiments discussed in Chapters 7 
and 8. Nevertheless, such experiments are important, for the following reasons: 

(1) They may provide independent checks of the values of the PPN parameters, and 
thereby independent tests of gravitation theory. They are independent in the sense that 
the physical mechanism responsible for the effect being measured may be rather different 
than the mechanism that led to the prior limit on the PPN parameters. An example is the 
Lense-Thirring effect, the dragging of inertial frames produced by the rotation of a body. 
It is not a preferred-frame effect, yet it depends upon the parameter a. So while a bound 
on a; achieved by measuring the Lense-Thirring effect may not beat the strong bounds we 
saw in Chapter 8, it comes from rather different physical effects. 

(ii) The structure of the PPN formalism is an assumption about the nature of gravity, 
one that, while seemingly compelling, could be incorrect or incomplete. According to 
this viewpoint, one should not prejudice the design, performance, and interpretation of an 
experiment by viewing it within any single theoretical framework. Thus, the 7 parameters 
measured by light-deflection and time-delay experiments could in principle be different, 
while according to the PPN formalism they must be identical. We agree with this viewpoint 
because, although theoretical frameworks such as the PPN formalism have proven to be 
very powerful tools for analyzing both theory and experiment, they should not be used in a 
prejudicial way to reduce the importance of experiments that have independent, compelling 
justifications for their performance. 

(iii) Any result in disagreement with general relativity would be of extreme interest. 


9.1 Testing the Effects of Spin 


Post-Newtonian gravity introduces an entirely new class of phenomena that are not present 
in Newtonian gravity — the effects of spin or rotation. These effects occur because, at 
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post-Newtonian order, the velocity of matter becomes important, and consequently the 
internal velocity of a rotating body, even if it is at rest, can affect the gravitational field 
it generates, and can influence how it responds to an external field. These effects, known 
in various contexts as spin-orbit and spin-spin coupling, frame-dragging, Lense-Thirring 
effects, gravitomagnetism, and so on, are fundamentally relativistic. This is also true in 
quantum mechanics, where spin effects arise only when special relativity is merged with 
quantum theory to produce the Dirac equation. In post-Newtonian theory, the term “spin” 
refers only to the rotation of a finite-size, massive body. It does not refer to quantum 
mechanical spin, although there are numerous parallels between effects in the two realms. 

There are precessions of spins in Newtonian gravity, but the nature of the effects is 
completely different. For example, the “precession of the equinoxes” is caused by the 
precession of the Earth’s rotation axis with a period of about 26,000 years. However this 
is the result of the interaction between the Earth’s quadrupole moment (which is caused 
by the centrifugal forces associated with its rotation) and the gravitational fields of the 
Sun and Moon. It is a nonrelativistic, quadrupole interaction involving nonspherical mass 
distributions. By contrast, the relativistic effects of spin are dipole in nature. 


9.1.1 Effects of spin in two-body systems 


In Section 6.7 we derived the post-Newtonian effects of the spin of a body on its motion, 
and the effects of its motion in a gravitational field on its spin. We found that a spinning 
body experiences spin-orbit and spin-spin accelerations [Eq. (6.113)] produced by both 
its own spin and the spins of other bodies in the system. At the same time, its own spin 
experiences a precession, caused by spin-orbit, spin-spin and preferred-frame couplings. 
The key equations, (6.113), and (6.119)-(6.121), were derived for a general N-body system 
of spinning masses. Here we specialize to two bodies and derive results that will allow us to 
discuss several important tests of the effects of spin, including Gravity Probe B, LAGEOS 
tracking, and precessions in binary pulsars. 

We consider a binary system of bodies with masses m; and mz and spins §, and $5, in 
an elliptical orbit with elements p, e, 1, 2, and w. We will first work out the precession of 
the spin of body 1, averaged over an orbit. Using Eqs. (6.119) and (6.121) restricted to two 


bodies, along with the fact that vy) = (m2 /m)yv and v2 = —(m,/m)y, we obtain 
dS; 1 m,\ m2 
1 1 
+ 8 (4y +44 a1) 3 [3n(n- Sr) — S>| x S1, (9.1) 


where r = |x12|, 2 = xX12/r, and h = x x vy. By suitably interchanging labels 1 = 2, we 
can write down the corresponding equation for spin 2. Using our Keplerian orbit formulae 
of Eq. (6.73), and setting both spins to be constant on the right-hand-side, we can integrate 
over one orbit to find the change AS,. We make use of the fact that, when integrated over 


one period, 
1 20 nini I fade aR 
dt = : dt = 6’ —h'h’), 2 
E hp / P a ) 2) 
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where h = \/mp and p = a(1 — e”). The change in S| over one orbit is then given by 
AS, =n (2741+) 8G 5c 8 
ms Pp 
T 

Shp 
The first term is frequently called the geodetic precession; it is a spin-orbit effect, as it 
involves the interaction between the spin S; and the orbital angular momentum h. For the 
geodetic effect, the change in S$ is always confined to the orbital plane (orthogonal to h); 
it vanishes if the spin is parallel to A and is maximized if the spin lies in the orbital plane. 
Geometrically, the effect is partially linked to the fact that the parallel transport of a vector 
around a closed path in a curved spacetime causes a change in the direction of the vector 
related to the spatial curvature and the area enclosed by the path; the “2” part of the 
coefficient is related to this spatial curvature effect. The remaining piece of the coefficient 
is related to the special relativistic Thomas precession of a spin moving in a curved orbit. 

The second term, a spin-spin interaction, is referred to in many ways: Lense-Thirring 
precession, dragging of inertial frames and “Schiff” precession. If the companion’s spin is 
fixed, S| undergoes a precession about a fixed vector that is a combination of the spin of 
the companion and the orbital angular momentum. 

To find the spin-orbit and spin-spin contributions to the relative equation of motion for 
a binary system, a = a; — a, we take Eq. (6.113), set w = 0 and a3 = 0, and obtain 


a (47 +4 + a1) [$2 — 3h(h- S2)| x S$). (9.3) 


a(SO] = sa {mn xv): J(or+ 1+A)o+ (47 +4401) | 


+(n-v)n x J(or+ 1-A)o+ (47 +4401) | 
— 50x [ert De +5 +4 +ai)s||, (9.4a) 


a[SS] = 


co] We 


(47 +4 +1) . [5a(n Si) Sy) = 08: Sy — 28g S»)| ,  (9.4b) 


where 


Qq 
Ill 
| 
A) 
oe 
E 


a (9.5) 


Calculating the radial, cross-track and out-of-plane components of these perturbing 
accelerations, substituting into Eq. (6.74), and integrating over one orbit assuming that 
the spins are constant over an orbital timescale, we obtain the secular changes in the orbit 
elements. For the spin-orbit terms we find 


Aa =0, 
Ae=0, 


Ai = = (Cpcosw — Cg sinw) = pen 
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T 


sintAQ = = (Cp sinw + Cg cosw) = ipo? 
hea" e,. (9.6) 
hp 
where 
C= Oy tle + 5(4yt4t.an)S, (9.7) 


and the subscripts P, Q and h denote components along the pericenter direction, the 
direction in the orbital plane given by eg = h x ep, and the direction # normal to the 
orbital plane, respectively, while the subscripts 2. and | denote components along the line 
of nodes and along the direction given by e, = h x eg. Note that the arbitrary parameter 
A, related to the choice of world line of the spinning body (Box 6.3) drops out. 

Although the inclination of the orbit relative to the reference system changes, the 
inclination relative to the vector C does not. The rate of change of h is given by Ah = 
AQ sinteg — Ave, = (1/2hp)(CLeg — Cae), which is perpendicular to C. Thus the 
orbital angular momentum vector precesses around C with h - C = constant. 

For the spin-spin contributions we obtain 


Ae=0, 
Ae =0, 
ia -— (4y +4 + a1) [(S1)n(S2)a + (S2)n(Sia] , 
nA — (47 +44 an) [(S:)n(S2)a + (Sr)n(Si)a] 
oe a (dy 4440) Sy 36s (9.8) 


9.1.2 Tests of spin precession 


In 2011 the Relativity Gyroscope Experiment (Gravity Probe B or GPB) carried out 
by Stanford University, NASA, and Lockheed Martin Corporation, finally brought to a 
conclusion a space mission to detect the frame-dragging and geodetic precessions of an 
array of gyroscopes orbiting the Earth (Everitt et al., 2011; Everitt et al., 2015). The story 
of GPB is told in Box 9.1. 

To analyze the experiment, we use Eq. (9.3), with the mass of the gyroscope m, set equal 
to zero; m2 and S> are the mass and spin of the Earth, and p is the orbital semilatus rectum. 
To maximize the geodetic precession, it is useful to orient the spin of the gyroscope to be 
perpendicular to h, in other words to lie in the orbital plane. Then the spin will precess 
within the orbital plane, in the same sense as the orbital motion. To maximize the frame- 
dragging precession, one should maximize |Sy — 3h(h - S2)| = (S3 +3(h-S2)*)'/?, which 
means putting the orbit on the equatorial plane so that h and S) are parallel, and orienting 
the spin to be in the orbital plane so that S; is orthogonal to both h and S>. However the 
resulting precession of the spin will also be in the orbital plane, adding to or subtracting 
from the geodetic term. This makes it impossible to separate the two effects. An alternative 
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Gravity Probe B 


Gravity Probe B will very likely go down in the history of science as one of the most ambitious, difficult, 
expensive, and controversial relativity experiments ever performed. It was almost 50 years from inception 
to completion and US$ 750 million in the making, although only about half of the total time was spent as a 
full-fledged, approved space flight program. 

The origin of GPB dates back to around 1960, when Leonard Schiff at Stanford calculated the geodetic and 
Lense-Thirring precessions of a gyroscope and suggested the possibility of measuring them. Independently 
of Schiff, George Pugh at the Weapons Systems Evaluation Group of the US military had performed the same 
calculations a few months earlier. Pugh was assessing the use of high-performance gyroscopes in missile and 
aircraft guidance. He wondered how large the relativistic effects would be, and what it would take for a space 
experiment to measure them. Pugh’s classified work could not be published in the open literature, and so 
Schiff was initially given credit for the idea; only later was Pugh’s work discovered and recognized. It is worth 
noting that all this occurred in the very early days of space exploration, only two years after the launch by the 
Soviet Union of Sputnik, the world’s first artificial satellite. 

GPB started officially in late 1963 when NASA funded the initial R&D work at Stanford to identify the new 
technologies needed to make such a difficult measurement possible. Francis Everitt became Principal Investi- 
gator of GPB in 1981, and the project moved to the mission design phase in 1984. At that time, plans called for 
a preliminary flight on board the Space Shuttle to test key technologies to be used in GPB (similar in intent to 
the LISA Pathfinder mission of 2015), followed by full launch from the Shuttle a few years later. Unfortunately 
the 1986 Space Shuttle Challenger catastrophe forced a cancellation of the technology test, and a complete 
redesign of the spacecraft for a launch from a Delta rocket. The additional costs and delays arising from this 
engendered increasing criticism of the project, and NASA asked the National Academy of Sciences to conduct a 
thorough review of the project in 1994. Following the endorsement of that committee, GPB was approved for 
flight development, and began to collaborate with Lockheed-Martin and NASA‘s Marshall Space Flight Center. 
The satellite finally was launched on April 20, 2004 for a 16-month mission, but another five years of data 
analysis were needed to tease out the effects of relativity from a background of other disturbances of the gyros. 

There were four gyroscopes aboard GPB. Each gyroscope was a fused silica rotor, about four centimeters 
in diameter, machined to be spherical and homogeneous to better than a part per million, and coated with a 
thin film of niobium. The gyroscope assembly, which sat in a dewar of 2440 liters of liquid helium, was held at 
1.8 degrees Kelvin. At this temperature, niobium is a superconductor, and the supercurrents in the niobium 
of each spinning rotor produce a “London” magnetic moment parallel to its spin axis. Extremely sensitive 
magnetometers (superconducting quantum interference detectors, or “SQUIDs”) attached to the wall of the 
spherical chamber housing each rotor were capable of detecting minute changes in the orientation of the 
rotor’s magnetic moment and hence the precession of its rotation axis. 

After the spacecraft reached its orbit, the four gyros were aligned to spin along the symmetry axis of the 
spacecraft. This axis was also the optical axis of a telescope directly mounted on the end of the structure 
housing the rotors. Spacecraft thrusters oriented the telescope to point precisely toward IM Pegasi (except 
when the Earth intervened, once per orbit). In order to average out numerous unwanted nonrelativistic 
torques on the gyros, the spacecraft rotated about its axis once every 78 seconds. The satellite was placed 
in an almost perfectly circular polar orbit with an altitude of 642 kilometers above the Earth’s surface. 
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Almost every aspect of the spacecraft, its subsystems, and the science instrumentation performed 
extremely well, some far better than expected. Still, the success of such a complex and delicate experiment 
boils down to figuring out the sources of error. In particular, having an accurate calibration of the electronic 
readout from the SQUID magnetometers with respect to the tilt of the gyros was essential. The plan for 
calibrating the SQUIDs was to exploit the aberration of starlight, which causes a precisely calculable misalign- 
ment between the rotors and the telescope as the latter shifts its pointing toward the guide star by up to 20 
arcseconds to compensate for the orbital motion of the spacecraft and of the Earth. However, three important, 
but unexpected, phenomena were discovered during the experiment that affected the accuracy of the results. 

First, because each rotor is not exactly spherical, its principal axis rotates around its spin axis with a period 
of several hours, with a fixed angle between the two axes. This is the familiar “polhode” motion of a spinning 
top. In fact this polhoding was essential in the calibration process because it led to modulations of the SQUID 
output via the residual trapped magnetic flux on each rotor (about 1 percent of the London moment). The 
polhode period and angle of each rotor were expected to remain constant throughout the mission, leading 
to accurate calibrations, but instead these quantities actually decreased monotonically with time, implying 
the presence of some damping mechanism, and this significantly complicated the calibration analysis. In 
addition, each rotor was found to make occasional, seemingly random “jumps” in its orientation—some as 
large as 100 milliarcseconds. Some rotors displayed more frequent jumps than others. Without being able to 
continuously monitor the rotors’ orientation, the GPB team could not fully exploit the calibrating effect of the 
stellar aberration in their analysis. Finally, during a planned 40-day, end-of-mission calibration phase, the 
team discovered that when the spacecraft was deliberately pointed away from the guide star by a large angle, 
the misalignment induced much larger torques on the rotors than expected. From this, they inferred that even 
the very small misalignments that occurred during the science phase of the mission had induced torques that 
were probably several hundred times larger than the designers had estimated. 

What ensued during the data analysis phase following the mission was worthy of a detective novel. The 
critical clue came from the calibration tests. Here, they took advantage of the residual trapped magnetic flux on 
the gyroscope. Superconducting lead shielding was used to suppress stray magnetic fields before the niobium 
coated gyroscopes were cooled, but no shielding is ever perfect. This flux adds a periodic modulation to the 
SQUID output, which the team used to plot the phase and polhode angle of each rotor throughout the mission. 
This helped them to discover that interactions between random patches of electrostatic potential fixed to 
the surface of each rotor, and similar patches on the inner surface of its spherical housing, were causing the 
extraneous torques. In principle, the rolling spacecraft should have suppressed these effects via averaging, but 
they were larger than expected. 

Fortunately, the patches are fixed on the various surfaces, and so it was possible to build a parametrized 
model of the patches on both surfaces using multipole expansions, and to calculate the torques induced by 
those interactions when the spin and spacecraft axes are misaligned, as a function of the parameters. One 
prediction of the model was that the induced torque should be perpendicular to the plane formed by the two 
axes, and this was clearly seen in the data. Another prediction was that, when the slowly decreasing polhode 
period crosses an integer multiple of the spacecraft roll period, the torques fail to average over the roll period, 
whereupon the spin axis precesses about its initial direction in an opening Cornu spiral, then migrates to a new 
direction along a closing Cornu spiral. This is known as a loxodromic path, familiar to navigators as a path of 
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fixed bearing on the Earth’s surface. Detailed observation of the orientation of the rotors during such “resonant 
jumps” showed just such loxodromic behavior. In the end, every jump of every rotor could be identified by its 
“mode number,’ the integer relating its polhode period to the spacecraft roll period. 

The original goal of GPB was to measure the frame-dragging precession with an accuracy of one percent, 
but the problems discovered over the course of the mission dashed the initial optimism that this would be 
possible. Although the GPB team were able to model the effects of the patches, they had to pay the price of the 
increase in error that comes from using a model with so many parameters. The experiment uncertainty quoted 
in the final result—roughly 20 percent for frame dragging—is almost totally dominated by those errors 
(Table 9.1). Nevertheless, after the model was applied to each rotor, all four gyros showed consistent Lense- 
Thirring precessions. Gyro 2 was particularly “unlucky”—it had the largest uncertainties because it suffered 
the most resonant jumps. Numerous cross-checks were carried out, including estimating the relativity effect 
during different segments of the 12-month science phase (various events, including computer reboots and a 
massive solar storm in January 2005, caused brief interruptions in data taking), increasing and decreasing the 
number of parameters in the torque model, and so on. 

When GPB was first conceived in the early 1960s, tests of general relativity were few and far between, and 
most were of limited precision. But during the ensuing decades, there was enormous progress in experimental 
gravity in the solar system and in binary pulsars. By the middle 1970s, some argued that the PpN parameters y 
and cv, were already known to better accuracy than GPB could ever achieve. Given its projected high cost, 
critics argued for the cancellation of the GPB mission. The counterargument was that all such assertions 
involved theoretical assumptions about the class of theories encompassed by the PPN approach, and that 
all existing bounds on the post-Newtonian parameters involved phenomena entirely different from the 
precession of a gyroscope. All these issues were debated, for example, in the 1994 National Academy review 
of GPB that recommended its continuation. 

Disclosure: The author was a member of the 1994 NAS panel that recommended completion of GPB, and served 
as Chair of an external NASA Science Advisory Committee for Gravity Probe B from 1998 to 2011. 


strategy, and the one adopted by GPB, is to put the gyros into a polar orbit, so that h-S> = 0. 
This reduces the amplitude by a half, but with $; lying in the orbital plane to maximize 
the geodetic effect, the frame-dragging precession will now be perpendicular to the orbital 
plane. By separately measuring the north-south (geodetic) precession and the east-west 
(frame-dragging) precession, one can make a clean measurement of both effects, at least in 
principle. Dividing Eq. (9.3) by the orbital period P = 27(a*/m)'/?, we obtain the rate of 
angular precession in the two directions given by 
dS 
ae = Diceojens + Qfrpieew » (9.9) 
where 
3/2 
ao Me 2)-1 
Q(Geo} 2 (27 1) 5/2 a) e ) 
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(4y+4+ a1) 39.2mas yr, (9.10) 


where a and e are the semimajor axis and eccentricity of the orbit, and y is the angle 
between the spin axes of the gyroscope and of the Earth. In obtaining the numerical values 
for GPB, we used the fact that the satellite was in a circular orbit at 642 km altitude 
(a = 7013 km). The angular momentum of the Earth is known from combining data from 
the luni-solar precession of the Earth’s axis and from the multipole moments of the Earth, 
and is given by Jo = 0.3307m@R2,we, where Re = 6371 km is the Earth’s mean radius, 
and we = 7.2921 x 10~>s7! is the Earth’s rotational angular velocity. In a perfect world, 
the angle ~ would be chosen to be 90°, but in fact it was about 73°. This is because 
the precessions of the gyroscopes had to be compared with directions in inertial space, as 
established to high precision, for example, by the distant quasars. To make that comparison, 
an on-board telescope was oriented toward a “guide” star, and the spin directions were 
aligned to be parallel to the optical axis of the telescope. This alignment was operationally 
necessary because the spacecraft itself rotated about that same axis in order to average out 
numerous environmentally induced torques on the gyros. Many candidate guide stars were 
considered, but in the end the star IM Pegasi (HR 8703) was selected, in part because it 
was optically bright and relatively isolated in the sky, and in part because it was bright in 
the radio, so that its own proper motion relative to the quasars could be measured using 
VLBI to an accuracy better than the goal for measuring the gyro precessions. IM Peg lies 
at a distance of 96 pc, and at a declination of 16.83°, thus w = 73.2°. In fact, VLBI 
measurements of IM Peg made before, during and after the GPB mission gave a proper 
motion 34.3 + 0.1 mas yr~!, well within the error requirements for GPB (Bartel et al., 
2015). 

Two additional relativistic geodetic precessions had to be accounted for, one induced 
by the quadrupole moment of the Earth (~ 7mas yr~!), and the other induced by the 
gyroscopes’ orbital motion around the Sun (~ 20mas_yr~'). The final results for the GPB 
experiment are shown in Table 9.1; the GPB saga is told in Box 9.1. The complete technical 
details of GPB can be found in a special issue of Classical and Quantum Gravity, Vol. 32, 
2015. 

Another example of relativistic spin precession is in binary pulsars. In this case, the 
masses of the two bodies are comparable, and using the fact that each mass is given by 
m; = m(1 + \/1 — 47) /2, we can express the two precession rates as 


Table 9.1 Final results of Gravity Probe B. 


Measured Predicted 


Geodetic Precession(mas) 6602 + 18 6606 
Frame-dragging (mas) io ae Te OY 
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2 ifm mee 1 day 7 a 
deo] = 5.8 x 107 = Ai 
[Geo] = 5.8 x 10-“ deg yr (+) ( B, ) ine (9. 11a) 
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where v, is the spin frequency of the spinning companion, P, is the orbital period, 
B=h/ mR3, is the moment-of-inertia factor, and 


12/7 : m>m 


6 2 
a=$(1+5n+vi-m) = 1 > m2 =m, (9.12) 


167/7 : m<m, 


For simplicity we have set 7 = | and a; = 0. 

It is clear that the frame dragging precession is negligible in binary pulsar systems, 
whereas the geodetic effect could be measurable. Indeed, the first binary pulsar B1913+16 
provided the first evidence of this effect (see Section 12.1 for details about binary pulsars). 
Observations (Kramer, 1998; Weisberg and Taylor, 2002) indicated that the pulse profile is 
varying with time, which suggested that the pulsar is undergoing geodetic precession on a 
300-year timescale. The amount was consistent with GR, assuming that the pulsar’s spin is 
suitably misaligned with the orbital angular momentum. Unfortunately, the evidence also 
suggested that the pulsar beam may precess out of our line of sight by 2025. 

A better test was provided by the “double pulsar system,” JO737—3039, discovered 
in 2003 (see Section 12.1.2). Because the orbit is seen to be almost edge-on, the radio 
signal from the primary pulsar A is partially eclipsed by the magnetosphere of pulsar B. 
By monitoring the long-term evolution of the amplitude variations of pulsar A’s signal 
during eclipses and building a simple model of the magnetosphere of pulsar B, Breton 
et al. (2008) were able to measure the orientation of both the spin and magnetic axes of 
pulsar B relative to the orbital plane, as well as the rate of precession of the spin axis. Since 
both masses are known along with the orbital period and eccentricity (see Table 12.2), we 
can use Eq. (9.11a) to predict a value Qygeo} = 5.07 deg yr. The value inferred from the 
model was 4.7 + 0.7 deg yr~', a 13% test in agreement with general relativity. In fact the 
precession is so significant that in 2008 the radio beam of pulsar B went out of sight. 


9.1.3 Tests of spin effects on orbits 


In Section 9.1.1, we saw that spin-orbit and spin-spin interactions can induce precessions 
of orbital planes and of orbital pericenters. In the solar system, the effect of the spin of the 
Sun on most orbits is too small to be detectable; the best hope might be Mercury, where 
the additional contribution to the perihelion advance due to the Sun’s spin is ~ 3 x 1073 
arcseconds per century, of the same order as the current measurement uncertainties. 

On the other hand, the effect has been measured using an array of Earth-orbiting 
satellites, tracked to high precision using laser ranging. For a satellite with negligible mass 
and spin, a = 0, so the vector C that appears in Eq. (9.7) is given by 


i 
C= 5 (47+ 44+0)Se. (9.13) 


9.1 Testing the Effects of Spin 


In a reference system in which Sg points in the z-direction, Cg = 0, C, = Csinv and 
Cy, = Ccosv. Thus from Eq. (9.6), the inclination of the orbit relative to the Earth’s spin 
axis stays unchanged, while the line of nodes and the pericenter advance at the rates 


dQ 1 So 
ae —_— _ 4 +4 | 
( dt ., 4 (4 + a) a3(1 — e2)3/2 


1 Ro\> 0.218 - 
=5 (4 +4401) ( ; ) (1 — 2) as yr, 
dw 1 Se 
(G)_- 3 (47 +4 1) Ba ay COS L 
1 Re\> 0.655 4 
— (47 +4 a1) ( : ) (1 — e)3 as yr, (9.14) 


where we use the fact that w = cz — Q.cos v. Unfortunately, these effects are tiny compared 
to the effects of the Newtonian multipole moments of the Earth, given by 
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where J; is the quadrupole moment coefficient of the Earth, given by J; = 1.0864 x 107°. 
Notice that the Newtonian nodal advance rate is proportional to cosv, hence it vanishes 
for a polar orbit; for two orbits with the same orbital elements but with supplementary 
inclinations (12 = 7 — 11), the nodal advances are equal and opposite. This dependence 
is true for all the even multipole moments J4, Js and so on. Thus in order to measure the 
relativistic precession, one has to measure or suppress the Newtonian effect to extremely 
high accuracy. 

Van Patten and Everitt (1976) proposed measuring the frame dragging effect using 
two satellites in counter-rotating polar orbits. The Newtonian nodal precession would be 
suppressed, at least up to the effects of the errors Av; and Azz involved in achieving 
true polar orbits, and those errors could be measured to sufficient precision by having 
the satellites track each other as they passed one another over the poles. Ground tracking 
would measure the nodal precession, and the contribution of the Newtonian precession 
could be subtracted. This proposal never came to fruition. 

The 1976 launch of the “Laser Geodynamics Satellite’ (LAGEOS) and of a second 
LAGEOS in 1992 changed the situation. These satellites are massive spheres, 60 cm in 
diameter and weighing about 400 kg, placed in nearly circular orbits with semimajor axes 
approximately equal to 2Rg (the precise orbital elements are listed in Table 9.2). The 
spheres are covered with laser retroreflectors, similar to those used for lunar laser ranging. 
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Table 9.2 Orbital elements of laser-ranged satellites. 


Semimajor Orbital Inclination 
Satellite axis(km) period(min) Eccentricity to equator (°) 
LAGEOS I VA DST DS 0.0045 109.84 
LAGEOS II 12,168 228 0.0135 52.64 
LARES 7,821 115 0.0007 69.5 


Because of their large mass-to-area ratio, the spheres are less affected by atmospheric drag 
than other satellites at similar altitudes. This, combined with the high precision of laser 
ranging (which routinely achieves millimeter-level precision), means that their orbits can 
be determined extremely precisely. The LAGEOS satellites were launched primarily to 
carry out studies in geodesy and geodynamics, but it was soon recognized that they were 
potentially capable of measuring the relativistic nodal advance, which for the values for 
LAGEOS I shown in Table 9.2 amounts to approximately 30.7 mas/yr. 

Indeed, Ciufolini (1986) pointed out that if a second LAGEOS satellite were to have 
an inclination supplementary to that of LAGEOS I, then the Newtonian advance would 
be equal and opposite for the two satellites, so that if one measured the nodal advance of 
both satellites simultaneously and added them together, the large Newtonian effect would 
cancel out, up to the contributions of errors in achieving the precise orbital inclinations. 

Ciufolini and other relativists campaigned vigorously to have LAGEOS II launched with 
to = 70.16°, but other considerations prevailed in the end. LAGEOS II was launched with 
lo = 52.64°, mainly to optimize coverage by the world’s network of laser tracking stations, 
which was important for geophysics and geodynamics research. The fall-back option was 
then to combine the data from the two satellites as they were. One could still eliminate 
the largest Newtonian contribution coming from J with a suitable linear combination of 
the two measured nodal advances, thereby revealing the relativistic contribution and those 
coming from higher-order multipole moments. The uncertainties in the values of those 
moments would contribute to the error made in measuring the relativistic effect. For a time, 
Ciufolini and collaborators tried to include a third piece of data, the perigee advance of 
LAGEOS II (which has a small eccentricity), as a way to also eliminate J4, but this turned 
out to be plagued with systematic errors that were large and hard to control (Ciufolini et al., 
1997, 1998; Ciufolini, 2000). 

The turning point came with CHAMP and GRACE. Europe’s CHAMP (Challenging 
Minisatellite Payload) and NASA’s GRACE (Gravity Recovery and Climate Experiment) 
missions, launched in 2000 and 2002, respectively, use precision tracking of spacecraft to 
measure variations in the Earth’s gravity on scales as small as several hundred kilometers, 
with unprecedented accuracies. GRACE consists of a pair of satellites flying in close 
formation (200 kilometers apart) on polar orbits. Each satellite carries an on-board 
accelerometer to measure nongravitational perturbations, a satellite to satellite K-band 
radar link to measure variations in the Earth’s gravity gradient on short scales, and a 
GPS tracking unit to measure larger scale variations. With the dramatic improvements 
on J; obtained by CHAMP and GRACE, Ciufolini and his colleagues could now treat Jy 
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and higher multipole moments as known, and use the two LAGEOS nodal advances to 
determine Jy and the relativistic contribution. The final outcome was a successful test of 
the relativistic prediction at the level of 10 percent (Ciufolini et al., 2011). 

On February 13, 2012, a third laser-ranged satellite, known as LARES (Laser Relativity 
Satellite) was launched by the Italian Space Agency. Its inclination was 69.5°, very close 
to the required supplementary angle relative to LAGEOS I, and its eccentricity was very 
nearly zero (see Table 9.2). However, the launch failed to achieve the same semimajor 
axis as that of the two LAGEOS satellites, again preventing a perfect cancellation of the 
Newtonian effect. Nevertheless, combining data from all three satellites with continually 
improving Earth data from GRACE, the LARES team reported a test of frame-dragging at 
the 5 percent level (Ciufolini et al., 2016). 


9.2 De Sitter Precession 


This effect, an advance of the line of nodes of the Earth—Moon orbit, was first calculated 
by de Sitter (1916) as a consequence of the post-Newtonian perturbations of the lunar orbit 
induced by the gravitational potential of the Sun and by the orbital motion of the Earth— 
Moon system around the Sun (for details of the derivation in general relativity in modern 
language, see section 10.1.6 of PW). However, it can equally well be regarded as a geodetic 
precession of the Earth-Moon system’s angular momentum vector (its “spin”) about the 
angular momentum vector of the Earth-Sun orbit. From the geodetic term in Eq. (9.3), the 
change in Sgm, the Earth—-Moon’s spin, over one orbit around the Sun is given by 


NS Sey 1) Bens, (9.16) 
P 


where we have approximated m; = mg +mg <m, and mz = me ~ m. This precession 
leads directly to an advance of the lunar node, given by 


dQ 1 ee 1 


dt =a ) BP 1—e? 


= 1.917as/century , (9.17) 
where a and e are the semimajor axis and eccentricity of the Earth-Moon barycenter orbit 


around the Sun. This effect has been measured to about 0.6 percent using lunar laser 
ranging data (Dickey et al., 1994; Williams et al., 2004a, 2004b). 


9.3 Tests of Post-Newtonian Conservation Laws 


——>>>>>E—————————— ee ee 


Here we discuss tests of the “conservation law” PPN parameters 1, C2, ¢3, C4, and a3. We 
have already seen in Section 8.4 that, by virtue of its additional role as a preferred-frame 
parameter, a3 has been constrained to vanish to a few parts in 10° using data from wide- 
binary millisecond pulsars. 
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In addition, there is strong theoretical evidence that ¢4, which is related to the gravity 
generated by fluid pressure, is not really an independent parameter. In any reasonable 
theory of gravity there should be a connection between the gravity produced by kinetic 
energy (pv), internal energy (pI), and pressure (p). The idea is to consider an alternative 
PPN metric generated by a system of charged point masses, with gravitational potentials 
generated by masses, microscopic velocities, charges, and so on. Under suitable coarse 
graining as used in thermodynamics, this metric should transform smoothly to the 
hydrodynamic PPN metric. From such considerations, there follows (Will, 1976b) the 
additional theoretical constraint 


6) = Jae 25/36: (9.18) 


In addition the ratio of active to passive mass for such a bound system of point charges 
should take the form 


=14+=G 


wl 
mp 2 te) 


mp 
where E, is the electrostatic energy of the system of point charges. 

A nonzero value for ¢; would result in a violation of conservation of momentum, or of 
Newton’s third law in gravitating systems, based on differences in chemical composition. 

A classic test of Newton’s third law for gravitating systems was carried out by Kreuzer 
(1968), in which the gravitational attraction of fluorine and bromine were compared. 
Kreuzer’s experiment used a Cavendish balance to compare the Newtonian gravitational 
force generated by a cylinder of Teflon (76 percent fluorine by weight) with the force 
generated by that amount of a liquid mixture of trichloroethylene and dibromomethane 
(74 percent bromine by weight) that had the same passive gravitational mass as the 
cylinder, namely the amount of liquid displaced by the cylinder at neutral buoyancy. In 
the actual experiment, the Teflon cylinder was moved back and forth in a container of 
the liquid, with the Cavendish balance placed near the container. Had the active masses 
of Teflon and displaced liquid differed at neutral buoyancy, a periodic torque would have 
been experienced by the balance. The absence of such a torque led to the conclusion that 
the ratios of active to passive mass for fluorine and bromine are the same to 5 parts in 10°. 
Using the semi-empirical mass formula for nuclear electrostatic energies (which are the 
dominant contribution in this case), E./m ~ 7.6 x 10~4 Z(Z— 1)A~4/3, where Z and A are 
the atomic number and mass number, respectively, we obtain the bound |¢3| < 0.06. 

An improved bound was reported by Bartlett and van Buren (1986). They noted that 
current understanding of the structure of the Moon involves an iron-rich, aluminum-poor 
mantle whose center of mass is offset about 10 km from the center of mass of an aluminum- 
rich, iron-poor crust. The direction of offset is toward the Earth, about 14° to the east of 
the Earth—Moon line. Such a model accounts for the basaltic maria that face the Earth, and 
the aluminum-rich highlands on the Moon’s far side, and for a 2 km offset between the 
observed center of mass and center of figure for the Moon. Because of this asymmetry, 
a violation of Newton’s third law for aluminum and iron would result in a momentum 
nonconserving self-force on the Moon, whose component along the orbital direction 
would contribute to the secular acceleration of the lunar orbit. Improved knowledge of 
the lunar orbit through lunar laser ranging, and a better understanding of tidal effects 
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in the Earth-Moon system (which also contribute to the secular acceleration) through 
satellite data, severely limit any anomalous secular acceleration, with the resulting limit 
of 4 x 10~!? on the difference between the active-to-passive mass ratios for aluminum and 
iron. This leads to the bound |¢3| < 10~8. Nordtvedt (2001) examined whether this bound 
could be improved by considering the asymmetric distribution of ocean water on Earth; 
the self-acceleration of the Earth in the polar direction would then lead to an effect in the 
Earth—Moon orbit that could be tested by lunar laser ranging. 

Another test of PPN conservation laws involves an effect first pointed out, incorrectly, by 
Levi-Civita (1937). The effect is the secular acceleration of the center of mass of a binary 
system. Levi-Civita pointed out that general relativity predicted a secular acceleration in 
the direction of the periastron of the orbit, and found a binary system candidate in which 
he felt the effect might one day be observable. Eddington and Clark (1938) repeated the 
calculation using the N-body equations of motion of de Sitter (1916). After first finding a 
secular acceleration of opposite sign to that of Levi-Civita, they then discovered an error 
in de Sitter’s equations of motion, and concluded finally that the secular acceleration was 
zero. Robertson (1938) independently reached the same conclusion using the Einstein- 
Infeld-Hoffmann equations of motion, and Levi-Civita later verified that result. 

In fact, the center-of-mass acceleration does exist, but only in nonconservative theories 
of gravity (Will, 1976a). The simplest way to derive this result is to treat the two-body 
system as a single composite “body” in otherwise empty space, and to focus on the self 
acceleration terms in the equation of motion (6.46a). Setting a3 = 0, dropping the fluid 
terms €/ and P’, specializing the remaining terms to two bodies and then averaging over 
one orbit, we obtain 


m2 e 


1 
tom =~ 703A pape (9.20) 


where ep is a unit vector in the direction of the pericenter of body 1, 7 = mjm2/(m,+myz)* 
and A = (mz—m,)/(m,+m2). A consequence of this acceleration would be non-vanishing 
values for d’?P/dt*, where P denotes the period of any intrinsic process in the system 
(orbital period, spectral frequencies, pulsar periods). The observed upper limit on d’P,, /d?? 
of the binary pulsar B1913+16 places a strong constraint on such an effect, resulting in the 
bound |¢3| < 4 x 107 (Will, 1992b). 

No feasible experiment or observation has ever been proposed that would set a direct 
limit on the PPN parameters ¢; or C4. Note, however, that these parameters do appear 
in combination with other PPN parameters in observable effects, for example, ¢; in the 
Nordtvedt effect (see Section 8.1). 
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The landscape for testing gravitational theory underwent a dramatic upheaval in September 
1974, with the discovery by Hulse and Taylor of the Binary Pulsar (see Section 12.1 for 
an account of the discovery). Until then, all tests of relativistic gravitation were carried out 
under conditions of weak gravitational fields and slow motions, and were well described 
by the ppn framework of Chapter 4. Tests involving the orbits of bodies, such as those of 
Mercury and other planets around the Sun, could be safely studied by treating the planets 
as test bodies and calculating their geodesic motion; the notable exception to this was 
the Nordtvedt effect in lunar motion. And no tests prior to 1974 even remotely involved 
gravitational radiation. 

The discovery of the binary pulsar B1913+16 changed all that. Here was a system 
consisting of a neutron star and a compact companion star of almost equal mass; each star 
moves in a curved spacetime generated both by the companion star and by itself, and both 
stars are moving at substantial speeds. Neutron stars are not weak-field objects; the self- 
gravitational binding energy of a neutron star can be as much as 20 percent of its rest mass. 
Thus the ppn framework no longer applies, or at least must be applied with great caution. 
And within a few weeks of the announcement of the discovery, it was realized that, with 
orbital velocities of order 200km s~!, the system was so relativistic that the dissipative 
effects on its orbit resulting from the emission of gravitational radiation might eventually 
be detectable. And indeed, by 1979, the inspiral of the orbit had been measured in an 
amount that agreed with the prediction of general relativity, providing the first evidence of 
the existence of gravitational radiation. 

In this chapter and the two chapters that follow, we will focus on this new landscape for 
testing relativistic gravity. This chapter will deal with the structure and motion of bodies 
with strong internal gravity, such as neutron stars and black holes; “compact body” has 
become the standard term for such astronomical objects. This is known as the “strong- 
field” regime of gravity since the spacetime geometry inside and near such objects is 
characterized by m/r ~ 1, as compared with 10~° near the solar surface. Chapter 11 will 
treat gravitational radiation in general relativity and alternative theories of gravity. This is 
frequently termed the “dynamical” regime of gravity, where motions are sufficiently rapid 
that the gravitational waves generated can be significant, both for direct detection, and for 
the back-reaction on the evolution of the system. Using the basic frameworks developed 
in these two chapters, we will describe in Chapter 12 specific strong-field and dynamical 
tests of relativistic gravity, including tests involving binary pulsars, tests using gravitational 
waves from inspiralling binaries of compact bodies, such as the waves detected by the 
LIGO/Virgo observatories beginning in 2015, and tests involving orbits of stars and matter 
outside black holes. 
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We will begin by discussing the structure of neutron stars and black holes in general 
relativity and in a range of alternative theories in Sections 10.1 and 10.2, and then will turn 
to their motion in Section 10.3. 


10.1 Structure of Neutron Stars 
ee 


Neutron stars were first suggested as theoretical possibilities in the 1930s (Baade and 
Zwicky, 1934). They are highly condensed stars where gravitational forces are sufficiently 
strong to crush atomic electrons together with the nuclear protons to form neutrons, to 
raise the density of matter above nuclear density (p > 3 x 10'* g cm~>), and to cause the 
neutrons to be quantum-mechanically degenerate. At such high densities, exotic phenom- 
ena can occur, including superfluidity, strange quark matter, and quark-gluon plasmas. A 
typical neutron-star model has a mass between 0.7 and 2.5 Mo, and a radius of order 10 km. 
However, they remained just theoretical possibilities until the discovery of pulsars in 1967 
(Hewish et al., 1968) and their subsequent interpretation as rotating neutron stars. There 
followed significant progress in calculating the structure of neutron stars using different 
nuclear equations of state, in obtaining firm estimates for the maximum mass of neutron 
stars, and in developing robust methods for modelling rapidly rotating neutron stars. 

Today, with over 2300 pulsars known (Lorimer and Kramer, 2012), neutron stars are 
fully accepted as members of the astronomical zoo, and indeed comprise a number of 
different classes. There is the class of mostly isolated pulsars of periods between 0.1 and 
10 s, representing the bulk of the population. The class of “recycled” pulsars with periods 
between one and 100 milliseconds, found mostly in binary systems, includes objects whose 
spin periods are so stable that they rival the best Earth-bound atomic clocks, yielding 
methods both for testing general relativity and for detecting gravitational waves. The class 
of accreting, X-ray emitting neutron stars has provided laboratories for studing the behavior 
of matter and magnetic fields under extreme conditions. The statistics of neutron stars 
also provide constraints on models of supernovae and on the evolution of binary systems 
containing massive stellar progenitors. Pulsars are regarded as important laboratories for 
testing theories of the equation of state of matter at ultrahigh densities. They could also be 
useful for testing alternative theories of gravity, and this section is devoted to discussing 
the structure of these objects in a range of theories. 

In Newtonian gravitation theory, the equations of stellar structure for a static, spherically 
symmetric star composed of matter at zero temperature are given by 


’ = ae , [Hydrostatic equilibrium], 
r 
P=D(p); [Equation of state] , 
¢ G =) App, ‘Neild equniony, (10.1) 
r 


where p(r) is the pressure, p(r) is the density and U(r) is the Newtonian gravitational 
potential. Note that 7 = 0 (more precisely, 7 << the Fermi temperature of the degenerate 
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matter in the star) is an adequate approximation for neutron-star matter for all but newly 
formed hot neutron stars; this results in the simple equation of state p(p). 

In any metric theory of gravity, it is simple to write down the equations corresponding 
to the first two of these three equations because they follow from the Einstein Equivalence 
Principle (Chapter 2), which states that in local freely falling frames the nongravitational 
laws of physics are those of special relativity, 7” , = 0, p = p(p). Thus, we have in any 
basis, 


r,=0, p=plo). (10.2) 


For a perfect fluid, 7!” = (p + p)u“u” + pg”, where we have lumped the internal energy 
pl into p (compare Eq. (3.77)). 

It is useful to rewrite these equations in a form that parallels the first two parts of 
Eq. (10.1). For a static, spherically symmetric spacetime, there exists a coordinate system 
in which the metric has the form 


ds? = — 229) dP 4 PAM |G? 4 PHO PaO? , (10.3) 


where dQ? = dO? + sin*@d¢? is the element of the two-sphere. For theories of gravity 
with a preferred frame, established by an auxiliary vector or tensor field, for example, the 
assumption of spherical symmetry is only valid for stars at rest with respect to the preferred 
frame. There exists the freedom to change the coordinate r by r = g(7). If the radial 
coordinate is chosen so that p(r) = 0, the coordinates are called ‘curvature coordinates.” 
In such coordinates, 27r measures the proper circumference of circles of constant r. In 
general relativity, they are known as Schwarzschild coordinates. If the radial coordinate is 
chosen so that j(7) = A(r), the coordinates are called “isotropic coordinates.” 

Now, for hydrostatic equilibrium, the equations of motion 7’”.,, = 0 may be written in 
the form 


os oe —(p + p)u’ uj, > (10.4) 


where j runs over r, 8, and @. For spherical symmetry only the 7 = r component is 
nontrivial, and using the fact that u = (e~®, 0, 0, 0), we obtain 


ma ==() +p) 2 
Notice that in the Newtonian limit, p < p, ® ~ —U and we recover the first two of 
Eqs. (10.1). Eq. (10.5) are valid independently of the theory of gravity, assuming static 
spherical symmetry. The field equations for ®, A, and yu, along with field equations for any 
auxiliary fields, will depend upon the theory. 
In constructing a stellar model, boundary conditions must be imposed. One set imposes 
a continuity condition for the matter, and defines the stellar surface at radius R: 


p=pP(p). (10.5) 


dp 

— = 0; R)=0. 10.6 

dls P(R) (10.6) 
Assuming that the spacetime far from the neutron star can be approximated as asymptot- 
ically flat, the metric functions are constrained to behave as r — oo according to [see 


Eq. (5.6)] 
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These conditions guarantee that after transforming to asymptotically Lorentz coordinates, 
and converting to geometrized units (Gioday = 1), the metric will have the form 


Zo > -1+ ~ 8y >0, gx > On, (10.8) 


and thus that the Kepler-measured mass of the star will be m. The asymptotic boundary 
conditions on the auxiliary fields of the theory are assumed to take the form, as r — 00, 
and in the preferred rest frame (if any): 


o(r) > $0, 
K"(r) + (K, 0, 0, 0), 
BY’ (r) —+ diag(bo, b1, bor”, bar’ sin?6) . (10.9) 


These boundary values ¢o, K, b; could depend on time through cosmological evolution, or 
they could depend on the presence of nearby matter, such as a companion star in a binary 
system. The presence of nearby matter could affect the internal structure of the neutron star 
because of violations of the strong equivalence principle (see Section 3.3 for discussion). 
If the timescale over which these parameters change is long compared to the internal 
dynamical timescale of the neutron star, one can still approximate the star’s structure as 
quasistatic, with its structure evolving adiabatically as the external parameters vary. If the 
timescales are comparable, such as near the end of an inspiral of two compact bodies, then 
one must resort to a fully dynamical calculation. 

Another problem that must be confronted in building neutron star models in alternative 
theories of gravity is that real neutron stars are in motion because of the motion of the host 
galaxy relative to the cosmic background radiation, and in some cases because of orbital 
motion around a companion. This means that in theories of gravity with auxiliary vector or 
tensor fields, one cannot automatically assume spherical symmetry, since these fields will 
single out a preferred direction in space from the point of view of the neutron star’s rest 
frame. Depending on the theory, it may be possible to treat these nonspherical effects as 
small perturbations of an underlying spherical model. 

For a range of nuclear equations of state, one then can determine such important 
quantities as the mass-radius relation, the maximum mass, the surface gravitational 
redshift, the innermost stable circular orbit, and, for rotating neutron stars, the moment 
of inertia, the quadrupole moment and the Love numbers. These astrophysically relevant 
parameters could then be compared with available data in order to place constraints on the 
alternative theory (as well as on the equation of state). 

Another important set of parameters, known as “sensitivities,” encodes the degree to 
which such quantities as the mass, radius and moment of inertia vary when the parameters 
oo, K, and 6; vary. These quantities will play an important role in the equations of motion 
for compact bodies in alternative theories of gravity (see Section 10.3). 

Unfortunately, this exhausts the generic features of the equations of relativistic stellar 
structure, so we must now turn to specific theories. Here we describe neutron stars in only 
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a sampling of theories of gravity. Berti et al. (2015) give a more thorough review, covering 
a wide range of alternative theories. 


General relativity 


In curvature coordinates (u(r) = 0), and in units where G = 1, the field equations (5.13) 
take the form 
d 
i [r(1 — | = 82r'p, 
Fr 
db = m(r) + 4rrp 


dr r(r—2m(r)) oe 
The first equation has the solution 
eh = (: = zu : (10.11) 
r 
where 
m(r) = 41 [ 1? o(r')dr' . (10.12) 
0 


These equations, together with Eq. (10.5) and the boundary conditions (10.7) (with 
co = c; = 1), are sufficient to calculate neutron-star models. Called the TOV (Tolman, 
Oppenheimer, Volkoff) equations for hydrostatic equilibrium, they are the standard tool 
for the study of nonrotating neutron stars within general relativity. For reviews, including 
models of rotating neutron stars, see Arnett and Bowers (1977), Baym and Pethick (1979), 
Salgado et al. (1994), Cook et al. (1994), Stergioulas (2003) and Lattimer and Prakash 
(2007). The maximum masses of neutron star models range from around 1.4 Mo for soft 
equations of state to 2.7 Me for stiff equations of state. The discovery of pulsars with 
masses in excess of 2M, coupled with the assumption that general relativity is correct, 
has made it possible to rule out a number of soft equations of state. 


Scalar-tensor theories 


Using curvature coordinates (u(r) = 0), and defining 


-1 
ee [ = =m] , (10.13) 


r 


we can put the field equations (5.35) for scalar-tensor theories into the form 


dm 4nGr° p—3p 1 2m o, 2w'd 
= p 1 ® r r ; W 5) 
dr @ 3+2w 81G r ees 26 3+ 2w 


d® 1 E +41Grp/¢ ; wr,” “| 


dr (1+r¢,/26)| r(r—2m) | 4¢ ob 


d (20-n46\ eA T8xGr(p—3p) 
4 (;age-a8) _ ie | 
dr ( . dr 3+2w r—2m Wy |: (10.14) 
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where G is the gravitational coupling constant, and w’ = dw/d@. The present value of G is 
related to the asymptotic value of ¢ by (see Eq. (5.54)) by 


_ G (4+20(60)\ _ 
Gtoday = bo (; a =e) =1. (10.15) 


For the special case of Brans-Dicke theory (w = constant), neutron star models differ only 
slightly from those of general relativity for almost all values of w > 0 (Salmona, 1967; 
Hillebrandt and Heintzmann, 1974). For general scalar-tensor theories with w’ positive and 
with relatively large values of wo, the differences are typically of order 1/wo (Zaglauer, 
1992). However, within a class of scalar-tensor theories for which A(y) = oboe? /2 [see 
Eq. (5.38)], and for negative values By = 2wj¢0/(3 + 2wo)? < —4.85, a phenomenon 
known as “spontaneous scalarization” can occur, whereby nonlinear interactions between 
gravity and the scalar field can lead to large deviations of the scalar field inside the star from 
its values outside, resulting in dramatically different neutron-star structure (Damour and 
Esposito-Farese, 1993, 1996, 1998). For a semi-analytic analysis of neutron-star structure 
in scalar-tensor theories, along with an extensive bibliography, see Horbatsch and Burgess 
(2011). 

Neutron star models have also been studied in various versions of f(R) and chameleon 
scalar-tensor theories (Kobayashi and Maeda, 2008; Upadhye and Hu, 2009; Babichev and 
Langlois, 2009, 2010; Cooney et al., 2010; Jaime et al., 2011). 


Vector-Tensor Theories 


Neutron-star models in Einstein-4Ether theory were studied by Eling et al. (2007), 
extending earlier work by Eling and Jacobson (2006). The star was assumed to be isolated 
in an asymptotically flat spacetime and at rest relative to the preferred frame induced by 
the vector field. They first imposed the constraints of Eq. (5.65), so that aj = a2 = 0. 
They restricted the parameters further by requiring that propagating modes of gravitational 
waves be stable, carry positive energy and have speeds > 1. They assumed that the vector 
field had only a time component (K, = 0). This is not strictly required by the assumption 
of a static system, so it may be overly restrictive. Since the vector field in Einstein-4Ether 
theory has unit norm, this implies that K°(r) = e~®™. In static spherical symmetry with 
K, forced to vanish, it turns out that the constants cz and c3 play no role, and that the only 
relevant parameter is c4, or equivalently the combination c;4. Recall from Chapter 5 that 
0 < ci4 < 2. Eling et al. (2007) found, for example that the maximum mass was in general 
a monotonically decreasing function of cj4, ranging from about 2.4 Mo for ci4 = 0 to 
about 1.8 Me for cj4 = 1.9 for a stiff hadronic equation of state, and from 2.2 Mj to about 
1.7 Me for a medium hadronic equation of state. For any MIT bag model equation of state 
with strange quarks, and for a soft hadronic equation of state, the maximum mass was less 
than 2 Mz for all values of c)4. The observation of neutron stars with masses > 2 Me thus 
constrains both the equation of state and the parameter c)4. Yagi et al. (2014a) extended 
this to a broad range of equations of state compatible with 2Mo neutron stars, and also 
studied models in Khronometric theory. 
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Tensor-Vector-Scalar Theories 


Neutron stars in TeVeS were first studied by Lasky et al. (2008). While that analysis used 
the original flawed version of TeVes (Skordis, 2008), Lasky (2009) showed that, for static 
spherical symmetry, with the constraint K,. = 0, the equations used are equivalent to those 
obtained from the improved TeVeS theory, as described in Section 5.5. The models depend 
on the asymptotic value ¢o of the scalar field (0 < 49 < 1), on the vector coupling constant 
c,4, and on the scalar coupling constant k ~ 0.03. They used a polytropic equation of state 
with [ = 2.46, which leads to a maximum mass in general relativity of only 1.65 Mo 
(this was before the discovery of a 2 Mj neutron star). As in Einstein-Ather theory, the 
maximum mass was found to be a decreasing function of c)4, holding k and @o fixed. For 
c14 = 0.5 and dp = 0.003, the maximum mass decreased slightly with increasing &. In the 
limit of small values of the three parameters c)4, k and ¢o, the models approached their 
general relativistic counterparts. 


Quadratic gravity theories 


Pani et al. (2011) studied neutron star models in a class of quadratic theories described by 
the action 


=| Lacon + fillol)Rap +fallol) RR 


— (|d))e”8,6"0,0 — ro)| (—2)'ax 
+ Ino(q4,4’(|A|)uv)» (10.16) 


where ian is the Gauss-Bonnet invariant, given by 
Rag = R? — 4RogR™ + RopysRO™, 10.17 
GB B By 


and ¢ is potentially a complex scalar field. The use of R2,, in the action (10.16) guarantees 
that the field equations contain no more than two derivatives of the metric (compare with 
Eq. (5.77)). They then specialized to Einstein-Dilaton-Gauss-Bonnet gravity, with ¢ real, 
fo = (167G)~!, and V(¢) = 0. In spherical symmetry, the Chern-Simons term *RR makes 
no contribution, so f4 can be set equal to zero. Finally, they used a parametrization fj (¢) = 
(a/167)e®? where a and G are constants, with a having dimensions of (length)”. They 
chose boundary conditions in which the metric was asymptotically flat, and in which ¢ + 0 
as r — oo. They found that ¢ < | throughout the neutron star, and that the maximum mass 
was a decreasing function of the product a. For a reasonably stiff equation of state, the 
maximum mass ranged from 2.4 Mq for small values of a3, corresponding to the general 
relativistic limit, to 0.6Me for aB = 100M. Recall that the “natural” scale for a is 
Mo ranck = 10-*Mp. 

While spherical neutron stars are not affected by the Chern-Simons term, rotating 
neutron stars are. Slowly rotating neutron star models in dynamical Chern-Simons theory 
were constructed by Yunes et al. (2010), Ali-Haimoud and Chen (2011), and Yagi et al. 
(2013). Unfortunately, modifications to the maximum mass were found to be degenerate 
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with the equation of state. The stars develop a “dipole scalar charge” that can affect orbital 
motions in binary pulsars, but the effects were found to be too small to provide meaningful 
constraints on the theory (Yagi et al., 2013). 


10.2 Structure of Black Holes 
Saaz EEE ee ES eee 


In a certain sense, black hole theory has a longer history than neutron-star theory, as it dates 
back to a suggestion by the Reverend John Michell (1784) that, according to Newtonian 
gravity, objects might exist in which the escape velocity from the surface could exceed the 
speed of light. Laplace made the same suggestion independently in 1796 (Laplace, 1808, 
1799). Remarkably, Michell went on argue that such a body, though invisible, could be 
detected by measuring the motion of a companion star that might be in orbit around it. 
Michell had already used a sophisticated statistical analysis of William Herschel’s data on 
close associations of stellar pairs and triples to conclude that some of them must be self- 
gravitating systems. It would take 20 more years for Herschel to confirm the existence of 
binary stars. For a history of the ideas of Michell and Laplace concerning black holes, see 
Montgomery et al. (2009). 

Within general relativity, two key events in the history of black holes were the discovery 
of the Schwarzschild metric (Schwarzschild, 1916) and the analysis of gravitational 
collapse across the Schwarzschild horizon (Oppenheimer and Snyder, 1939). However, 
theoretical black hole physics really came into its own with the discovery of the true nature 
of the “event horizon” of Schwarzschild’s solution (Finkelstein, 1958; Kruskal, 1960) and 
the discovery in 1963 of the Kerr metric (Kerr, 1963), now known to be the unique solution 
for a stationary, vacuum, rotating black hole (with the Schwarzschild metric being the 
limiting case corresponding to no rotation). Other important developments in the theory 
of black holes include the full understanding of event horizons as nonsingular, causal 
boundaries between an exterior spacetime and an interior from which there is no escape, 
the establishment of black-hole uniqueness and stability, the formulation of singularity 
theorems, the development of black-hole accretion theory, and the discovery of black-hole 
thermodynamics and Hawking radiation. For a history, see Israel (1987). 

It was the discovery in 1971 of the rapid variations of the X-ray source Cygnus X-1 
by telescopes aboard the UHURU satellite that took black holes out of the realm of pure 
theory (Bolton, 1972; Webster and Murdin, 1972). The source of X-rays was observed to 
be in a binary system with the companion star HDE 226868; analysis of the nature of the 
companion and of its orbit around the X-ray source, and detailed study of the X-rays, led 
to the conclusion that the unseen body was a compact object with a mass exceeding 9 Mo. 
Since the maximum masses of white dwarfs and neutron stars are approximately 1.4 Mo 
and ~ 3 Mo, respectively, the simplest conclusion was that the object was a black hole. 
The source of the X-rays was believed to be the hot, inner regions of an accretion disk 
around the black hole, formed by gas stripped from the atmosphere of the companion star. 
Since 1971, many black hole candidates in X-ray binary systems have been found (Celotti 
et al., 1999). 
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Studies of the central regions of quasars, galaxies and globular clusters have indicated 
that supermassive black holes may be ubiquitous (see Kormendy and Richstone (1995) for 
an early review); indeed, every massive spiral galaxy appears to contain a central massive 
black hole. There is even a 4.3 x 10° Mg black hole in the center of the Milky Way, the pre- 
cise measurement of its mass being made possible by observations of stars in orbit around it 
(Eckart and Genzel, 1996; Ghez et al., 1998). Although the masses of such black holes are 
trivial compared to the total mass of their host galaxies, their apparent ability to generate 
extraordinary mass and energy outflow from accreting matter can have important feedback 
effects on the structure of the galaxy, including such phenomena as star formation. 

The capstone to the remarkable history of black holes is the September 14, 2015 
detection by LIGO of gravitational waves from the inspiral and merger of two black holes 
of masses 29 and 36 Mj (Abbott et al., 2016c). 

General relativity predicts the existence of black holes. Black holes are the end products 
of catastrophic gravitational collapse in which the collapsing matter crosses an event 
horizon, a surface whose radius depends upon the mass and angular momentum of matter 
that has fallen across it, and which is a one-way membrane for timelike or null world 
lines. Such world lines can cross the horizon moving inward but not outward. The 
interior of the black hole is causally disconnected from the exterior spacetime. There 
is considerable evidence to support the claim that any gravitational collapse involving 
sufficient mass, whether spherically symmetric or not, with zero net charge and zero net 
angular momentum, results in a black hole whose metric (at late times after the black hole 
has become stationary) is the Schwarzschild metric, given by 


-1 
ds? = (: “at dt’ 4 (1 *) dr’ +1 (d0” + sin’0d¢”) , (10.18) 
where M is the mass of the black hole. If the collapsing body has net rotation, the black 
hole is described by the Kerr metric. 

While curved spacetime is essential to the existence of event horizons as one-way 
membranes for the physical interactions, the existence of black holes is not an automatic 
byproduct of curved spacetime, but depends on the specific theory of gravity. 

The general approach for finding black hole solutions in a given theory is the same as for 
neutron stars, except that one assumes vacuum everywhere. In general relativity, solving 
Egs. (10.10) with p = p = 0 leads immediately to Eq. (10.18). To see ifa particular vacuum 
solution corresponds to a black hole, one must check whether a non-singular event horizon 
exists, and whether any true physical singularities in the spacetime are safely hidden behind 
the event horizon. In some theories, different fundamental speeds or “null cones” may be 
associated with different fields (see Section 11.3), and thus there may be more than one 
“event” horizon. 

Once again, we provide a mere sampling of black hole structure in alternative theories 
of gravity; see Berti et al. (2015) for a more thorough review, including various attempts 
to “parametrize” black holes in a manner similar to the PPN formalism. 


Scalar-tensor theories 


As one might expect, scalar-tensor theories, being in some sense the least violent 
modification of general relativity, predict black holes. However, what is unexpected is 
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that they predict black holes whose geometry is identical to the Schwarzschild geometry 
of general relativity. Roger Penrose was probably the first to conjecture this, in a 1970 
talk at the Fifth Texas Symposium on Relativistic Astrophysics. Motivated by this remark, 
Thorne and Dykla (1971) showed that during gravitational collapse to form a black hole, 
the Brans-Dicke scalar field is radiated away, leaving only its constant asymptotic value 
and a Schwarzschild black hole. Hawking (1972) argued on general grounds that the scalar 
field ¢ must be constant throughout the exterior of the horizon, given by its asymptotic 
cosmological value ¢o. Thus, the vacuum field equation (5.35a) for the metric is Einstein’s 
vacuum field equation, and the solution is the Schwarzschild solution. The scalar field has 
no effect other than to determine the value of the gravitational constant Giocai. (This result 
also holds for rotating and charged black holes.) 

In Brans-Dicke theory for instance, the most direct way to verify this (Hawking, 1972) 
is to use the vacuum field equation for y = ¢ — ¢o, Oy = 0, and to integrate the quantity 
yy over the exterior of the horizon between two spacelike hypersurfaces at different 
values of coordinate time. After an integration by parts, we obtain 


| eae*v=ea'x - >| (¢°)* dq =0. (10.19) 


Now the surface integrals over the spacelike hypersurfaces cancel because the situation is 
stationary. The integral over the hypersurface at infinity vanishes because y ~ r~! asymp- 
totically. The integral over the horizon vanishes because d¥iq is parallel to the generators 
of the horizon and is thus in a direction generated by the symmetry transformations of the 
black hole (Killing direction) whereas (y”)_. vanishes along that symmetry direction. Thus 


/ Gap? /—gd'x = 0. (10.20) 


But Y.. is spacelike since ¢ is stationary, so Yay > 0 everywhere. Equation (10.20) 
thus implies that y.. = 0. Sotiriou and Faraoni (2012) extended Hawking’s theorem to the 
class of f(R) theories that can be transformed into generalized scalar-tensor theories. 

These theorems can be evaded, for example, if the scalar field is time dependent (e.g., 
because of cosmic evolution) or complex, or if it is massive and the black hole is rotating 
(see Berti et al. (2015) for a review). 


Vector-tensor theories 


Black-hole solutions have been studied extensively in Einstein-4ther theory and Horava 
gravity (Eling and Jacobson, 2006; Tamaki and Miyamoto, 2008; Barausse et al., 2011; 
Barausse and Sotiriou, 2013a). Barausse et al. (2011) obtained numerical black hole 
solutions for a range of values of the two parameters cy and c_ between zero and unity 
(the equations are so complicated that no analytic solutions were found). They evaluated 
such observable or invariant quantities as the angular velocity of a particle at the innermost 
stable circular orbit (ISCO), the angular velocity of the circular photon orbit, the maximum 
redshift of a photon emitted by a particle on the ISCO, and the proper circumference of the 
outer event horizon in units of the black hole mass. For cy; = c_ = 0, the solutions 
were numerically equivalent to the Schwarzschild solution, as expected. These invariant 
quantities were insensitive to the value of c_, and depended fairly weakly on c_, deviating 


216 


Structure and Motion of Compact Objects 


from the general relativistic values by between five and 15 percent for c, ~ 0.6, reaching 
deviations of 50 percent for c, ~ 0.9. In Hofava gravity, the deviations from general rela- 
tivity were consistently smaller in most of the parameter space. Slowly rotating black hole 
solutions have also been analyzed (Barausse and Sotiriou, 2013b; Barausse et al., 2016). 


Tensor-vector-scalar theories 


Black-hole solutions in TeVeS were studied by Giannios (2005) and Lasky (2009). For a 
black hole at rest in the preferred frame, and for the restricted case in which the vector 
field has only a time component, the physical metric turns out to be identical to the 
Schwarzschild metric; in the unrestricted case (K, 4 0) only the asymptotic properties 
of the metric and fields were studied. 


Quadratic gravity theories 


Non-rotating black hole solutions in a range of quadratic gravity theories were studied with 
a view toward possible observational signatures by Pani and Cardoso (2009), Yunes and 
Stein (2011) and Kleihaus et al. (2011); for a brief review, see Blazquez-Salcedo et al. 
(2017). In Chern-Simons theory, spherically symmetric black holes are identical to their 
general relativistic counterparts, but rotating black holes are not. Slowly rotating black 
hole models were analysed by Yunes and Pretorius (2009), Konno et al. (2009) and Yagi 
et al. (2012). 


10.3 The Motion of Compact Objects 


In Chapter 6, we derived the N-body equations of motion for massive, self-gravitating 
bodies within the PPN framework [see Eqs. (6.45) and (6.46)]. A key assumption that 
went into that analysis was that the weak-field, slow-motion limit of gravitational theory 
applied everywhere, in the interiors of the bodies as well as between them. This assumption 
restricted the applicability of the equations of motion to systems such as the solar system. 
However, when dealing with systems containing a neutron star or a black hole with highly 
relativistic spacetimes near or inside them, one can no longer apply the assumptions of 
the post-Newtonian limit everywhere, except possibly in the interbody region between 
the relativistic bodies. Instead, one must employ a method for deriving equations of 
motion for compact objects that, within a chosen theory of gravity, involves (a) solving 
the full, relativistic equations for the regions inside and near each body, (b) solving the 
post-Newtonian equations in the interbody region, and (c) matching these solutions in an 
appropriate way in a “matching region” surrounding each body. This matching presumably 
leads to constraints on the motions of the bodies (as characterized by suitably defined 
centers of mass); these constraints would be the sought-after equations of motion. Such 
a procedure amounts to a generalization of the Einstein-Infeld-Hoffmann (EIH) approach 
(Einstein et al., 1938). 
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Let us first ask what would be expected from such an approach within general relativity. 
In the fully weak-field post-Newtonian limit, we found that the motion of post-Newtonian 
bodies is independent of their internal structure, that is, there is no Nordtvedt effect. Each 
body moves on a geodesic of the post-Newtonian interbody metric generated by the other 
bodies, with proper allowance for post-Newtonian terms contributed by its own interbody 
field. This is the ETH result. 

It turns out however, that this structure independence seems to be valid even when the 
bodies are highly compact (neutron stars or black holes). The only restriction is that they 
be quasistatic, nearly spherical, and sufficiently small compared to their separations that 
tidal interactions may be neglected. This would be a bad approximation for a neutron star 
about to spiral into a stellar-mass black hole, for example, but is a good approximation for 
binary pulsars or for the inspiral of a binary system of compact bodies before the last few 
orbits. 

Although this conclusion has not been proven rigorously, a strong argument for its plau- 
sibility can be presented by considering in more detail the matching procedure discussed 
earlier. We first note that the solution for the relativistic structure and gravitational field of 
each body is independent of the interbody gravitational field, since we can always choose 
a coordinate system for each body that is freely falling and approximately Minkowskian 
in the matching region and in which the body is at rest. Thus, there is no way for the 
external fields to influence the body or its field, provided that we can neglect tidal effects 
due to inhomogeneities of the interbody field across the interior of the matching region. 
Only the velocity and acceleration of the body are affected. Now, if the body is static and 
spherically symmetric to sufficient accuracy, its external gravitational field is characterized 
only by its Kepler-measured mass m and is independent of its internal structure; the field 
is given essentially by the Schwarzschild metric. Thus, the matching procedure described 
above must yield the same result, whether the body is a black hole of mass m or a post- 
Newtonian body of mass m. In the latter case, the result is the ETH equations of motion 
(see Section 6.2), so those equations must be valid in all cases. A slightly different way 
to see this is to note that because the local field of the body in the freely falling frame 
is spherically symmetric, depends only on the constant mass m, and is unaffected by the 
external geometry, the acceleration of the body in the freely falling frame must vanish, so 
its trajectory must be a geodesic of some metric. The metric to be used is a post-Newtonian 
interbody metric that includes post-Newtonian terms contributed by the body itself, but 
that excludes self-fields. This was shown for nonrotating black holes in a seminal paper by 
D’Eath (1975), and was extended to a variety of contexts by Rudolph and Borner (1978a,b), 
Kates (1980), Thorne and Hartle (1985), Anderson (1987), Itoh et al. (2000), and Taylor 
and Poisson (2008). This structure independence was also verified explicitly to 2PN order 
for self-gravitating fluid bodies by Mitchell and Will (2007). For a pedagogical treatment 
of this approach at 1PN order, see PW, section 9.4. 

A key element of these derivations is the validity of the Strong Equivalence Principle 
within general relativity (see Chapter 3 for discussion), which guarantees that the structure 
of each body is independent of the surrounding gravitational environment. By contrast, 
most alternative theories of gravity possess additional gravitational fields, whose values in 
the matching region can influence the structure of each body, and thereby can affect its 
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motion. Consider as a simple example a theory with an additional scalar field (scalar- 
tensor theory). In the local freely falling coordinates, although the interbody metric is 
Minkowskian up to tidal terms, the scalar field has a value ¢o(¢). In a solution for the 
structure of the body, this boundary value of ¢o(t) could influence the resulting mass, for 
example, by controlling the locally measured value of G, thus turning the mass into a 
function m(¢o) of the scalar field. Thus, the asymptotic metric of the body in the matching 
region may depend upon its internal structure via the dependence of m on ¢o (essentially, 
the matching conditions will depend upon m, dm/d@, ...). Furthermore, the acceleration 
of the body in the freely falling frame need not be zero, as we saw in Sections 2.4 and 3.3. 
If the mass-energy of a body varies as a result of a variation in an external parameter, we 
found, using cyclic gedanken experiments that assumed only conservation of energy, that 
(see Eqs. (3.97) and (3.99)) 

Olnm 
do 
Thus, the bodies need not follow geodesics of any metric, rather their motion may depend 

on their internal structure. 

In practice, the matching procedure described above is very cumbersome (D’Eath, 
1975). Within general relativity, a simpler method for obtaining the EIH equations of 
motion is to treat each body as a “point” mass of inertial mass m, and to solve Einstein’s 
equations using a point-mass matter action or energy-momentum tensor, with proper care 
to neglect or regularize infinite “self” fields. In the action for general relativity, we thus 
write 


1 
a — Ageodesic ™ mV EBX V) ead Vo. (10.21) 


1 
I= ag | BV=ea's Ym f dr, (10.22) 


where 7, is proper time along the world line of the ath body. By solving the field equations 
to 1PN order, it is then possible to derive straightforwardly from the matter action an N- 
body EIH action in the form 


Ter = fee sei udde (10.23) 


with a Lagrangian L written purely in terms of the variables (x,, va) of the bodies. The 
result is Eq. (6.81), with the PPN parameters set to their general relativistic values. The 
N-body EIH equations of motion are then given by 


ieee a=1,...N. (10.24) 

In alternative theories of gravity, we assume that the only difference is the pos- 
sible dependence of the mass on the boundary values of the auxiliary fields. In the 
quasi-Newtonian limit (Sections 2.4 and 3.3), this was sufficient to yield the complete 
quasi-Newtonian acceleration of composite bodies including modifications (Nordtvedt 
effect) due to their internal structure. Thus, following the suggestion of Eardley (1975), we 
merely replace the constant inertial mass m, in the matter action with the variable inertial 
mass m,(wW4), where 14 represents the values of the external auxiliary fields, evaluated at 
the body (we neglect their variation across the interior of the matching region), with infinite 
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self-field contributions excluded. The functional dependence of m, upon the variable wy, 
will depend on the nature and structure of the body. Thus, we write the action of the 
alternative theory in the form 


I=Ig-) > / mq (Wa[Xa(Ta)]) A705 (10.25) 


where /g is the action for the metric and auxiliary fields 44. In varying the action with 
respect to the fields g,,, and wy, the variation of m, must now be taken into account. In 
the post-Newtonian limit, where the fields 44 are expanded about asymptotic values po) 


according to wy = o) + dwy, it is generally sufficient to expand m,(w4) in the form 
Oma 1 Om, 
ma(Wa) = mab?) +) —“ bua t= >> SvdUet.-.. (10.26) 
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Thus, the final form of the metric and of the N-body Lagrangian will depend on mg = 
Ma (' ie and on the parameters 0m, / ay, and so on. We will use the term “sensitivities” 
to describe these parameters, since they measure the sensitivity of the inertial mass to 
changes in the fields 74. Thus, we define 


(4) _ Olnm, 
“any? 
& Inmg 
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and so on, where the derivatives are typically taken holding the total baryon number of the 
body fixed, for bodies made of matter. For black holes, some other method must be used 
to identify the sensitivities. Similar sensitivities can be defined for the radius and moment 
of inertia of the compact body. The final result is a “modified EIH formalism.” 

Gralla (2010, 2013) developed a more general theory of the motion of “small” bodies 
characterized by such parameters as mass, spin and charge in an environment of external 
fields, and argued that Eardley’s ansatz is a special case of that general framework. 


10.3.1 A modified ElH formalism 


By analogy with the ppNn formalism, a general EIH formalism can be constructed using 
arbitrary parameters whose values depend both on the theory under study and on the 
nature of the bodies in the system. However, to keep the resulting formalism simple, we 
will make some restrictions. First, we restrict attention to semiconservative theories of 
gravity; every Lagrangian-based metric theory of gravity falls into this class. We will also 
restrict attention to theories of gravity that have no Whitehead term in the post-Newtonian 
limit; all of the currently viable theories described in Chapter 5 satisfy this constraint. 
This generalizes the formalism presented in the first edition of this book, which made a 
restriction to fully conservative theories. Nordtvedt (1985) discussed a similar formalism 
for compact-body dynamics. 
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Each body is characterized by an inertial mass m,, defined to be the quantity that appears 
in the conservation laws for energy and momentum that emerge from the EIH Lagrangian. 
We then write for the metric, valid in the interbody region and far from the system, 


g “Me 2 
=-1+2 ) —— +O ; 
£00 + : Qy as + (e ) 


&0j = O(e*/) ’ 


~ Ma 
ge = ( +2>°% ca) Se + O(e?), (10.28) 


where a% and y7 are functions of the parameters of the theory and of the structure of the 
ath body. For test-body geodesics in this metric, the quantities a*m,, and }*, aZmz, are the 
Kepler-measured active gravitational masses of the individual bodies and of the system as 
a whole. In general relativity, a = 77 = 1. 

To obtain the modified EIH Lagrangian, we first generalize the post-Newtonian semi- 
conservative N-body Lagrangian, Eq. (6.81), in a natural way, to obtain 


ie. ; 
Lem = — >) mq E aVa— 5(1+4,)y4| 


a 


1 MaMb 1 
€ 2 dX dX Yab gu + 3B ase 2 (Gar + 6Biar) + Cab) Ya * 


1 
_,) (Gab + Eab) (Yq ; Nap) (V5 : na) 


1 a 
— 2 .s > > Dabe a. 


a b#acH#a 


Me 


(10.29) 
Tac 
where Mg, = Xap /Tab. Notice that we did not introduce a parameter in front of the v2 term 
in Eq. (10.29). Any such parameter can always be absorbed into a new definition of the 
inertial mass m/, of body a. We are then free to change the constant term — }°, m, to be the 
sum of the new inertial masses — }°,, mj. This has no effect on the equations of motion, 
but does allow the Hamiltonian derived from Lyyy to be the sum of the new inertial masses 
at lowest order. We also did not include a term of the form (mgmp/rap)(Vp - Nap); Such a 
term can be associated (via a total time derivative in the Lagrangian) with the Whitehead 
term in Eq. (6.81), which we have chosen to reject. 

The quantities Az, Gay, Bas, Cab, Eb and Daye are functions of the parameters of the 
theory and of the structure of each body, and satisfy 


Gab = Gab) > Cab = Cab), Eab = E(ab)> Dabe = Da(be) : (10.30) 


Note that B,, has no special symmetry, in general. 

In general relativity, G,, = Bay = Dane = 1, while A, = Cay = E,» = 0. In the post- 
Newtonian limit of semiconservative theories (with € = 0), for structureless masses (no 
self-gravity), the parameters have the values [compare Eq. (6.81)] 


1 
C= 1, Ba = 3(27 +1), Dave = 28 —1, 


Aa =0, Cy=a,—a ’ Eah = Q2. (10.31) 
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In the fully conservative case, including contributions of the self-gravitational binding 
energies of the bodies, the parameters have the values 


a, 2 
Gap = 1+ (48-7 3)( | s) 
Mq Mp 
1 
Bas = 3(27 +1), Dabe = 28-1, 
Ag =Ca = C4 = 0; (10.32) 


where (2, is the self-gravitational energy of the ath body. 

In a theory of gravity that singles out a preferred frame, such as the Einstein-ther 
theory, the Lagrangian (10.29) must be understood to be defined in that frame. To obtain 
the Lagrangian in a moving frame, we must carry out a post-Galilean transformation 
of the Lagrangian (see Section 4.3 for discussion). We make a low-velocity Lorentz 
transformation (post-Galilean transformation) from (t, x) to (7,€) coordinates, given by 


x=€+(1450*)}wr + 5 (€-w)w+é x O(e), 


1 3 1 
tar(14 pw tou) + (1450) e: w+7 x O(e). (10.33) 
From the transformation (10.33), we have 
dg 1 1 
Vy =VUagtw- = (E,W) — Wa — (Ya) (6 +5") F 
1 1 


Ea [1 + Sloan)? + (Oe: E,)(va- nis) — (0 &5)(U0 My) | (10.34) 


Yab ~ ab 


where €,, €, and v, = d&,/dr are to be evaluated at the same time, given by a clock at 
the spatial origin (€ = 0) of the moving coordinate system, and n’, = €,,/€a». The new 
Lagrangian is given by 


L(E,7) = L(x, t)—— — —- (10.35) 


where dt/dr is evaluated at € = 0, and where we are free to subtract a total time derivative 
of a function /to simplify the new Lagrangian. Substituting these results into Eqs. (10.29) 
and (10.35), dropping constants and total time derivatives, and replacing €,, and v, with 
rap and v,, we obtain the Lagrangian in the moving frame 


1 
L=Lypyt A dimes [v2w? +2v2(vq- Ww) +2(v4- w)’] 


> | 
Yab 


Capw —2 (6Biab) — Ca) (Va 7 w) 


ert 
+Eap [(w + Man)” +2(W + May) (Va Nav) |} (10.36) 
Notice that the Lagrangian is post-Galilean invariant if and only if 
Ag = Bias} = Cap = Exp = 0. (10.37) 


These quantities are the preferred-frame parameters of our modified EIH formalism. 
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Since our ultimate goal is to apply this formalism to binary systems containing compact 
objects, let us now restrict attention to two-body systems. We obtain from L the two-body 
equations of motion 


M7n 


mate — (3G12Bi2 + Diaz) 


my 
aQq= —— 
r 


[29 fp + 69i2Bira) + 2Pau + Gr2(Cia + Ex2)] = 
: 3Bi2 — Gia + Ar) vj - 5(3B +Gi2+ E12)V3 
3 (6Boi2 + 2Gi2 + Ci2 + E12) 1 - v2 — ; (Gi2 + E12) (n+ v2)? 
45 (C12 + Gi2Ai) Ww? + ; (Ci2 — 6Brray + Ex2 + 2G12.A1) v1 


1 3 
+ 5 (Cio + 6Biiay — E12) v2 -w + 5Ei2[(w- my? + 2(w- n)(v2- n))} 


ais 2 n- {[3Bi2 + Gi2(1 + Ai)] v1 — 3Bizv2 + Gi2Aiw} 


l mv 
= corn -{(6Bii2) + 2G12 + Ci2 + E12) v1 


_ (6B) + Ci. — E\2) v2 + 2E1.w} 
— <——n- { (Cio — 6Byzy + E12 — 2G12A1) v1 — (Ci2 — 6Byz] — E12) v2 
—2(Gi2A1 — Ey2) w} , 
ay = {1 =2;n- —n}, (10.38) 


where a, = dv,/dt, x = x; —X2, r = |x|, and n = x/r. If the conditions of Eq. (10.37) 
hold, then the preferred-frame terms vanish. 

To Newtonian order, the center of mass X = (mx, + m2x2)/m of the system is 
unaccelerated, thus, to sufficient accuracy in the post-Newtonian terms, we can set X¥ = 
X = 0 and write 


epee [= i ote) |x. (10.39) 


As in Section 6.4.1, we define 


v= Vy v2, a=ay\ a2, 


m=m+m, n= a A = ——_.. (10.40) 
We also define the coefficients 


m m 
G=Ce, BLa=Byy3, B-=S8p), P= — Din a = Das 


C=Cs,. f= eu, AM = (™)" A, (-=)' a2. (10.41) 
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Then the equation of motion for the relative orbit takes the form 
a=a,+ apr, (10.42) 
where the purely two-body, or “local” contributions have the form (we use “hats” to denote 
parameters associated with compact bodies) 
i oe i S [2 (4. 4 Api? +43") 4 ivB, | (10.43) 
where 


a =5 {901 6n) — 3By —3AB_ n(C +2€) +94] , 


ee 2. 9 
As = G [2nG + 3B, +n(C +€)+3AB_]+D, 
By = G(1 — 2n) +3B, +3AB_+9C+G4%, (10.44) 


and the preferred-frame contributions have the form 
1 3 
app = zi n| ( ~-d + 2gGA®) (wey) += (Ge ae gA\) (w- n) 
r 2 2 
1 
—w Fate -v) + do(n- »)| + GA?) v(n »)| ; (10.45) 
where 
&, = A(C+€) -6B_ —2GA, 
dy =E-GA, (10.46) 
These two parameters play the role of compact-body analogues of the PPN parameters a1 
and qa in the context of binary systems. 
In the Newtonian limit of the orbital motion, we have a = Gmx/r>, with Keplerian 
orbit solutions given by Eq. (6.73), but with m replaced by Gm. The post-Newtonian terms 


in Eq. (10.43) can be inserted into the Lagrange planetary equations (6.74). Using the 
methods outlined in Section 6.4.2, we find that the pericenter advance per orbit is given by 


Aw = 2™ pg-), (10.47) 
Pp 
where p = a(1 — e”), and where 
1 1 
P =GB, + <(G°-D)+ 2G [sas +n +e) + GA] (10.48) 


This is the only secular perturbation produced by the post-Newtonian terms in Eq. (10.43). 
In the PPN limit, this result agrees with Eq. (6.78) for semiconservative theories. 

We can convert Aw to a rate by dividing by the orbital period and can use Ph = 
2r(a?/Gm)'/?, to eliminate a to obtain 


. 67 27m a 4/3 
(w) = Pde) ( P ) PG : (10.49) 


This expression will be useful for discussing binary pulsar measurements. 
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We can also use the preferred-frame contributions to the equations of motion apr to 
work out the secular perturbations of the orbit elements of a binary system, extending 
the analysis of Section 8.2 to systems with compact bodies. The results are identical to 
Eggs. (8.2), (8.33), (8.35), and (8.37), with € = 0, and with a, and @) replacing a; and a, 
respectively. 

Returning to the full N-body Lagrangian (10.29), and working at quasi-Newtonian order, 
we can derive the leading effects in a hierarchical three-body system, such as the pulsar 
triple JO337+1715, including the Nordtvedt effect. The resulting equation of motion for the 
inner binary is given by Eggs. (8.10)-(8.12), with 


m* = Giom, 
1 
a=>5 (Giz + Gos) , 
ae eee (10.50) 


As an illustration of this modified EIH framework for compact bodies, we will again 
focus on specific theories where calculations have been carried out. As we have already 
discussed, a variety of approaches have shown that the EIH equations of motion for 
compact objects within general relativity are identical to those of the post-Newtonian limit 
with weak fields everywhere. In other words, in general relativity, Ga, = Bap = Pave = 1, 
and the remaining coefficients vanish, independently of the nature of the bodies. 


10.3.2 Scalar-tensor theories 


The modified EIH formalism was first developed by Eardley (1975) for application to 
Brans-Dicke theory. It makes use of the fact that only the scalar fleld ¢ produces an external 
influence on the structure of each compact body via its boundary values in the matching 
region. This boundary value of ¢ is related to the local value of the gravitational constant 


as felt by the compact body by 
G (44+ 2w 
Ghocal = — ; 10.51 
we aos 


where G is the fundamental gravitational coupling constant. Thus we will treat the inertial 
mass m, of each body as a being a function of ¢. Then, by defining the deviation of ¢ 
from its asymptotic value ¢o as in Section 5.3.2, 6 = ¢o(1 + WV), we can write down the 
expansion 


1 
ma(o) = Ma [ + sq + —(s? +s), — 54)? + ow) F (10.52) 


2 


where m, = m,(¢o), and we define the dimensionless sensitivities 


_ {dilnm,(¢) 
a ( ding , ; 
a & Tie), 
= ( rire he (10.53) 
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The action for the theory is then written 


1 wh) ap 
= EG / [or - ee $b. sf 22d f= dX i ma()dTa » (10.54) 
where the integrals over proper time 7, are to be taken along the world line of each body a. 
We have dropped the scalar self-interaction potential U(¢). It is straightforward to vary the 
action with respect to g,,, and ¢ to obtain the field equations, 


a ed Ce 
+2 (yr SC) (10.55a) 
= Tw (sxor- I6nGo5" - ae 10") (10.55b) 
where 
ea Lime Juliut (uf) '8(x — Xa), (10.56) 


where ul’ is the four-velocity of body a. The equations of motion take the form 


T™”.,, ——¢” =0. (10.57) 


In harmonic gauge, the relaxed field equations for he? are given by Eqs. (5.43), while for 
the scalar field, the relaxed field equation is given by Eqs. (5.45) and (5.46), but with the 
replacement 


a3 , 
Ext aG oar). (10.58) 


where T = T?” gy. 
Repeating the post-Newtonian calculation using the relaxed field equations, as described 
in Section 5.3.2, we obtain, to lowest order 


a, (1 — 255) + Of), 


m 
goo = aE 2 (1-205) +0(€), 


gy = —4(1 oy mh + O(e5/2), 


_ oe 
Mp 
gi = On : 125 24260) (10.59) 
p 78 
where ¢ = 1/(4+ 2wo)s 1 = |x — x,|, and we have chosen units in which Gioca) = 1. 


For the explicit O(«7) ene in goo and W, see Mirshekari and Will (2013). Notice that the 
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active gravitational mass as measured by test-body Keplerian orbits far from each body is 
given by 

(ma)a = Mq(1 — 2¢5q). (10.60) 
To compare with the post-Newtonian limit, we must connect the sensitivity s, to the 
self-gravitational energy 22,, defined in Box 6.2. The gravitational energy is actually 
proportional to Giocq). Thus if the total inertial mass of a weak-field, gravitationally bound 


body is m = mon +7 + Ghoca?+ Fin, Where 7 and Fi,; are the kinetic and internal energies, 
respectively, then Q. = dm/dGioca. We can then express the sensitivity s for a given body 


in the form 
— (dinma($)\ (dn Giocat 
*¢~ \dinGiocal Jp \ dnd )y’ 


Qa 
= (1-2))—, (10.61) 
Ma 


where we used Eq. (10.51), including the fact that w also depends on ¢, and used the 
definition (see Eq. (5.55)), 


ows 
A= : 10.62 
(3 + 2u)(4 + Qu) C062) 
The active mass in the weak field limit is then given by 
Qa 
(ma)a =mg|1+2¢(1+2dA)—], (10.63) 
Mg 


in agreement with Eq. (6.59). 
From the complete post-Newtonian solution for g,,, and WY, we can obtain the matter 
action for the ath body, given by 


ee / ma($) (—go0 — 2gojvd — gaviv')'”? de. (10.64) 


To obtain an N-body action in the form of Eq. (10.23), we first make the gravitational terms 
in /,, manifestly symmetric under interchange of all pairs of particles, then take one of each 
such term generated in /,, and sum the result over a. The resulting N-body Lagrangian then 
has the form of Eq. (10.29) with 


Gab =1 2¢ (Sq + Sp 2saSb) ey 
1 
Bap = 3 [Gap + 2(1 > ¢)] ’ 


Dabe = GabGac + 2¢(1 - 2sp)(1 — 25¢) [A(1 _ 28a) + 2¢s1] ; 
Aa = Bias) = Cab = Ear = 9. (10.65) 


The quasi-Newtonian equations of motion obtained from the EIH Lagrangian are 


MpXab 
ao 3 [1 — 2¢ (sa + 85 — 2sasp)] « (10.66) 
bAa a 
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For bodies with weak internal fields the product term s,s, may be neglected, and the 
acceleration may be written 


ig (10.67) 
te Ta, TD 


where (ma )p is given by Eq. (10.60) and where 
(mp)a = ma(1 — 2¢sa), (10.68) 


in agreement with our results of Section 6.3, Eq. (6.59). However, if the bodies are 
sufficiently compact that s, ~ s, ~ 1, then because of the product term s,5p, it is 
impossible to describe the quasi-Newtonian equations simply in terms of active and passive 
masses of individual bodies. 

Mirshekari and Will (2013) extended this analysis to higher post-Newtonian orders, 
obtaining the equations of motion for compact binary systems through 2.5PN order. At 
1PN order, the equations have the form 


a 
Be = om [m (1 — Apn) — 7vBpy] , (10.69) 
where 
-\.2 3 2 _ a = \Ggm 
Apy = —(1+37+ 9) 4 ud +2(2+7n+74+64,+AB6_) a 
Bpy = 2(2— +7), (10.70) 
where the coefficients 7, 3,, and BL are defined by 
7 = —2G-'¢(1 — 2s,)(1 — 252), 
_ 1.- _ 
B4 = 2 (81 = Bp) > 
By, =G-C(1 — 2s,)? [A(1 — 251) + 265%] , 
Bx = G-*C(1 — 251)? [A(1 — 250) + 2055] . (10.71) 


Roughly speaking, the sensitivity s ~ [self-gravitational binding energy]/[mass], so 
So ~ 10719, So~ 10~°, swhite aware ~ 107+. For neutron stars, whose equation of state is 
of the form p = p(p), a model is uniquely determined (for a given function w(@)) by the 
local value of ¢o (or Giocat) and by the central density p,, in other words m = m(G, p,) 
and N = N(G, p.), where N is the total baryon number. The sensitivity can be expressed in 
terms of G as 


ee ie 20) (Fe) (10.72) 
N 


However, in any theory of gravity in which the local gravitational structure depends upon 
a single external parameter whose role is that of a gravitational “constant,” it can be 


shown that 
Olnm Olnm Olnm OlnN 
= . 10. 
(aa). (gue) gan), onc) en 
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For fixed central density and fixed equation of state, a simple scaling argument reveals that 
mand N scale as G~3/, so that 


3 Olnm 
SNS = 5 (1 +2) [ - (Sen). | : (10.74) 


Note that (Om /ON)g is the injection energy per baryon. 

For the special case of Brans-Dicke theory, Salmona (1967) showed that neutron star 
models differed very little from their general relativistic counterparts, for values of w 
as small as 2.5. Using general relativistic neutron star models Eardley (1975) obtained 
sensitivities ranging from 0.07 to 0.39 for masses ranging from 0.5Me to 1.41M6 
(the maximum mass for the 1975-era equation of state used). Zaglauer (1992) obtained 
sensitivities for neutron stars with masses between about 1.1Mo and 1.46M. for the B, O 
and M equations of state tabulated by Arnett and Bowers (1977). The sensitivities ranged 
from 0.19 to 0.30 for the soft B equation of state, and from 0.08 to 0.12 for the stiff M 
equation of state. 

Similar results hold for general scalar-tensor theories with w( positive (corresponding 
to positive values of the Damour-Esposito-Farése parameter o). However, when wy is 
sufficiently negative (corresponding to Gy) < —4), spontaneous scalarization can occur, 
leading to neutron star models whose structure is very different from those in general 
relativity, with dramatically different values of the sensitivities (Damour and Esposito- 
Farése, 1993, 1996). In the DEF approach (see Section 5.3), the relevant parameter is 
a? ~ (1 — 2s)/(4 + 2w). For example, for a theory corresponding to a value fy = —6, 
with asymptotic conditions on the theory chosen to agree with the 1993 solar-system bound 
of wy > 1500, they found a? ~ 0.4, corresponding to s ~ —600! As we will see in 
Chapter 12, binary pulsar observations make it possible to place very tight bounds on this 
regime of scalar-tensor theories. 

For black holes, we found in Section 10.2 that the scalar field is constant in the exterior 
of the black hole. Thus from Eq. (10.59) we must have spy = 1/2. Another way to see 
this is to note that, because all information about the matter that formed the black hole 
has vanished behind the event horizon, the only scale on which the mass of the hole can 
depend is the Planck scale, mptanck = (ic /G)'/?, and thus mpy & mplanck x G7!/? x g!/2, 
and hence spy = 1/2. Substituting s, = s, = s- = 1/2 in Eqs. (10.65), we obtain 
Gab = Bay = 1 —C and Daye = (1 — C)?. Then it is straightforward to see that if we 
replace each mass in the Lagrangian (10.29) by m/(1 — ¢) = m(4 + 2w9)/(3 + 2w0o), 
then apart from a meaningless overall mutiplicative factor of (1 — ¢)~', the Lagrangian 
is identical to that for general relativity. Since black-hole masses cannot be measured by 
any means other than via their orbital dynamics, this implies that, at 1PN order, the motion 
of a system of black holes in scalar-tensor theory is indistinguishable from the motion in 
general relativity. In fact, Mirshekari and Will (2013) showed that this is valid through 
2.5PN order. 

For binary systems containing one black hole, the equations are again identical to those 
of general relativity through 1PN order. However, at 1.5PN order (where dipole radiation 
reaction begins) and beyond, the equations differ from those of general relativity by terms 
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that depend on the single parameter Q = ¢(1 — ¢)~!(1 — 2s,)*, where 5; is the sensitivity 
of the companion to the black hole (see Mirshekari and Will (2013) for details). 


10.3.3 Constrained vector-tensor theories 


Because the norm of the vector field in Einstein-Aither and Khronometric theories is 
constrained to be —1, the structure of a spherically symmetric compact body at rest with 
rest to the preferred rest frame does not depend on it. However it could depend on the time 
component K°, or more properly on the invariant quantity K - u, where u is the body’s 
four-velocity. (The structure of a rotating body could also depend on the projection of K 
along the body’s spin axis, but here we will focus on non-rotating bodies.) We define for a 
body with four-velocity u, 


y=—-K*u, =14+¥, (10.75) 
where we assume that far from the system, for a test body at rest, K- u = —1. We define 
the sensitivities 

_ (dinmal) 
Sa = | ’ 
@iny 7 1-4 
lnm 
f= (aE) (10.76) 
d(Iny) el 


where the derivatives are to be taken holding baryon number fixed. Then in terms of V, we 
can expand 


1 
m,(7) =m, tsa + 5 (s, +s! — sq) + o(w)| (10.77) 


With this assumption, Foster (2007) and Yagi et al. (2014a) derived the metric and 
equations of motion to post-Newtonian order for systems of compact bodies in Einstein- 
Ether theory. The metric is given by 


Zo = —1+2U—2U’ — 26, +36), 4+ O(e), 


go =g/ + Oe), 
ge = (1 + 2U) 6x + O(e’), (10.78) 


and the vector field is given by 
K° = (—go9) 7? + Of), 
Ki=kK+0(8”), (10.79) 


where 


Gymp G2mpm, Gymp 
v= > an ; 2=) “+, = >) 7 vi(1 — sp), 


rpyr, 
b he. eens b 


. G Mp _; F 
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; Gymp | 3d ; 
2. [crv + Cf mime -v»)] (10.80) 
b 


where Gy = 2G/(2 — cj4) and cj4 = c; + c4. The quantities B; and C, are complicated 
expressions involving the constants c), c2, c3, and ca of Einstein-Ather theory and the 
sensitivities s, (see Eqs. (23) and (29) of Foster (2007)). 

From Eggs. (10.78) and (10.79), we obtain 


i! 3 se 
U(x.) = a + rig + 2v2U(xq) — viki (xa) + O(e*). (10.81) 


Note that, when vj = 0, V = 0, resulting in no dependence of the inertial mass on the 
vector field. Writing the action for the ath body as 


— / m,(¥) (goo — 2gyvi — gavivt)'”? ar, (10.82) 


we expand to post-Newtonian order, make the action manifestly symmetric under inter- 
change of all pairs of particles, select one of each term, and sum over a. After rescaling 
each mass by m, — m,/(1 — s,) and replacing the constant term in the Lagrangian by 
the sum of the rescaled masses, we obtain the modified EIH Lagrangian in the form of 
Eq. (10.29), with 
Gy 

(1 — sa)(1 — sp)’ 

Bab = Gav(1 — Sa), 
Dabe = GabGac(1 — Sa); 


Gap = 


/ 


Sq 
Aa = Sq — ie Rs ? 
Cab = Gab [ay a2 4 3 (3 t Sp) Qa Rab| ey 
Cab = Gab [a2 a Qab = Rav] ’ (10.83) 


where 


1 2-—c¢ 2-—¢ 
Quan = ( = ) (a1 — 202)(Sa +55) + Fs + saSb » 
123 


2 \2c4 — C44 
8 
Rab = = [c_ (sa +55) + (1 — c_)sqs5] « (10.84) 
1 


Here a; and az are the PPN parameters of Einstein-Ather theory, given by Eqs. (5.63), 
c+ = c; £03, and cj23 = cy +c2 +c3. The two-body equations of motion that follow from 
the Lagrangian with these coefficients agree with Eq. (33) of Foster (2007) (after correcting 
a sign and a parenthesis in Eqs. (34) and (35) of that paper). 

Note that the Lagrangian is in general not Lorentz invariant, and therefore will exhibit 
preferred-frame effects. Even when the parameters c; are constrained so as to enforce a; = 
Q2 = 0, making the dynamics Lorentz invariant in the fully weak field post-Newtonian 
limit, the dynamics of compact bodies will still be dependent on the overall motion of the 
system via the motion-induced sensitivities of the bodies. 
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The corresponding equations in Khronometric theory can be obtained from these by 
setting c) = —€, cp) = Ax, c3 = Be +, and cy = ax + «, and taking the limit € + oo 
(see Yagi et al. (2014a)). The parameters of the modified EIH Lagrangian are given by 
Eq. (10.83), but now with Gy = 2G/(2 — ax), and 


Qab = [(ax at Bx a 3Ax) (Sa a Sb) ate (2 —— QK)SaSb] > 


1 
Bx t+ XK 
1 
Rab = a. + a1) [Sq +85 — SaSp] (10.85) 


where a is the PPN parameter of Khronometric theory, given by Eq. (5.68). 

Yagi et al. (2014a) calculated neutron star sensitivities in both Einstein-A:ther and 
Khronometric theories. In order to do so, it was necessary to construct models for neutron 
stars moving uniformly relative to the preferred frame. From Eqs. (10.77) and (10.81), 
assuming uniform motion with no external bodies (U = k/ = 0), the sensitivity is given by 
s=v'dlnm /dv. They chose the coefficients cz and c4 so that the PPN parameters a and 
@2 saturate the bounds from solar-system measurements, and obtained fitting formulae for 
the sensitivities as a function of c;, c_ and the compactness M/R of the neutron star. 


Gravitational Radiation 


In this chapter, we develop the tools needed to treat gravitational radiation in general 
relativity and alternative theories of gravity. This is the “dynamical” regime of relativistic 
gravity, and since gravitational waves are intimately linked to the motion of masses, we 
begin (Section 11.1) with a brief history of the “problem of motion and radiation” in 
general relativity. This history has been at times muddled, at times controversial. We 
follow this (Section 11.2) with a short account of the status of today’s gravitational- 
wave detectors. We then turn to the theory of gravitational waves in general relativity 
and alternative theories, treating their speed (Section 11.3), polarization (Section 11.4) and 
generation, mainly by binary systems (Section 11.5). 


11.1 The Problem of Motion and Radiation 
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At the most primitive level, the problem of motion in general relativity is relatively 
straightforward, and was an integral part of the theory as proposed by Einstein.! This 
is geodesic motion, the motion of a test particle in a fixed spacetime provided by some 
distribution of matter, for example, that of a spherically symmetric star. The motion of a 
test particle does not generate gravitational waves, of course. The first attempts to treat 
the motions of multiple bodies, each with a finite mass and thus each contributing to the 
spacetime geometry, were made by de Sitter (1916) and Droste (1917) (see also Lorentz 
and Droste (1917)). They derived the metric and equations of motion for a system of N 
bodies, in what today would be called the first post-Newtonian approximation of general 
relativity (de Sitter’s equations turned out to contain some important errors, however). 

Einstein (1916, 1918) studied gravitational radiation, using the linearized approximation 
of his theory to derive the energy emitted by a body such as a rotating rod or dumbbell, 
held together by nongravitational forces. The first paper was marred by rather egregious 
errors; the second paper was correct, apart from a trivial numerical error, later spotted and 
corrected by Eddington (1922). The underlying conclusion that dynamical systems would 
radiate gravitational waves was correct. 

However, there was ongoing confusion over whether gravitational waves are real or 
are artifacts of general covariance. Although Eddington (1922) clearly elucidated the 
difference between the physical, coordinate independent modes and modes that were 
purely coordinate artifacts, he made the unfortunate remark that some gravitational waves 


! This history is adapted from Will (2011). See also Damour (1987). 
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propagate “with the speed of thought.” He was referring to the coordinate modes, but the 
understanding of the role of coordinates at the time was sufficiently poor that the remark 
cast a pall of doubt over the entire subject. 

For example, in 1936, in a paper submitted to The Physical Review, Einstein and Rosen 
claimed to prove that gravitational waves could not exist. However, the anonymous referee 
of their paper found that they had made an error. He pointed out that the singularity in 
their solution that caused them offer their negative conclusion was actually a harmless 
coordinate singularity. Upset that the journal had sent his paper to a referee (a newly 
instituted practice), Einstein withdrew the paper and refused to publish in The Physical 
Review again. Einstein became duly convinced that the referee was right, however, and a 
corrected paper showing that gravitational waves do exist — cylindrical waves in this case — 
was published elsewhere (Einstein and Rosen, 1937). Fifty years later it was revealed that 
the anonymous referee had been H. P. Robertson (Kennefick, 2005). 

The next significant advance in the problem of motion came a few years later. In 1938, 
Einstein, Infeld, and Hoffman published the now legendary EIH paper, a calculation of the 
N-body equations of motion using only the vacuum field equations of general relativity 
(Einstein et al., 1938). They treated each body in the system as a spherically symmetric 
object whose nearby vacuum exterior geometry approximated that of the Schwarzschild 
metric of a static spherical star. They then solved the vacuum field equations for the metric 
between each body in the system in a weak field, slow-motion approximation, equivalent 
to the post-Newtonian approximation. Then, using a version of what today would be called 
“matched asymptotic expansions” they showed that, in order for the nearby metric of each 
body to match smoothly to the interbody metric at each order in the expansion, certain 
conditions on the motion of each body had to be met. This is essentially the matching 
procedure described in Chapter 10. Together, these conditions turned out to be equivalent 
to the Droste—Lorentz N-body equations of motion. The internal structure of each body was 
irrelevant, apart from the requirement that its nearby field be approximately spherically 
symmetric.” 

During the 1960s, Fock in the Soviet Union and Chandrasekhar in the United States 
independently developed and systematized the post-Newtonian approximation in a form 
that laid the foundation for modern post-Newtonian theory (Fock, 1964; Chandrasekhar, 
1965). They developed a full post-Newtonian hydrodynamics, with the ability to treat 
realistic, self-gravitating bodies of fluid, such as stars and planets. In a suitable limit 
of “point” particles, or bodies whose size is small enough compared to the interbody 
separations that finite-size effects such as spin and tidal interactions can be ignored, their 
equations of motion could be shown to be equivalent to the EIH and the Droste-Lorentz 
equations of motion.* 


i) 


At the end of their paper, EIH noted that the detailed calculations were so incredibly complex that they could 
not possibly be published, but instead had been deposited in the US Library of Congress. The remark seems 
somewhat quaint in light of today’s dependence on algebraic software to handle the millions of terms involved 


in deriving equations of motion to high post-Newtonian orders. 
The author carried out this calculation as a term paper in Kip Thorne’s general relativity course in 1969. 


Evidently the paper was a success, for Thorne invited the author to join his research group the following 
summer. 


w 


234 


Gravitational Radiation 


Meanwhile, Pirani, Bondi, and others had demonstrated rigorously that gravitational 
waves were physically real, that they propagate with the speed of light and that they 
carry energy. At the same time, Joseph Weber began to build detectors for gravitational 
waves, based on suspended massive cylinders of aluminum that would vibrate in a resonant 
manner if a gravitational wave of the appropriate frequency passed them. In 1969 and 
1970, Weber made the stunning announcement that he had detected gravitational waves, 
most likely coming from the central regions of the Milky Way. The strength of the signals 
and rate of the detections were orders of magnitude higher than could be explained by 
reasonable astrophysical sources, and numerous follow-up investigations by physicists 
who built their own detectors failed to confirm Weber’s claims. By 1980, a consensus had 
emerged that Weber had not detected gravitational waves. Despite this negative outcome, 
Weber’s work had the positive effect of putting gravitational wave theory and detection 
on the scientific map, and of bringing into the field distinguished experimentalists from 
other branches of physics, such as Vladimir Braginsky, William Fairbank, Heinz Billing, 
Edoardo Amaldi, Ronald Drever, and Rainer Weiss. These scientists helped change the 
field from one dominated by theorists to one characterized by a healthy interplay between 
theory and experiment. Some of them began to work on a new type of gravitational-wave 
detection, based on laser interferometry. 

The discovery of the binary pulsar in 1974 revealed an inconvenient truth about 
the “problem of motion,” however. As Jiirgen Ehlers and colleagues pointed out in an 
influential 1976 paper (Ehlers et al., 1976), the general relativistic theory of motion and 
radiation, as it was understood at the time, was full of holes large enough to drive trucks 
through. They pointed out that most treatments of the problem used “delta functions” 
as a way to approximate the bodies in the system as point masses. As a consequence, 
the “self-field”, the gravitational field of the body evaluated at its own location, becomes 
infinite. While this is not a major issue in Newtonian gravity or classical electrodynamics, 
the non-linear nature of general relativity requires that this infinite self-field contribute to 
gravity. In the past, such infinities had been simply swept under the rug. Similarly, because 
gravitational energy itself produces gravity, it thus acts as a source throughout spacetime. 
This means that, when calculating radiative fields, integrals of the multipole moments of 
the source that are so useful in treating electromagnetic radiation begin to diverge in the 
case of gravity. These divergent integrals had also been routinely swept under the rug. 
Ehlers et al. further pointed out that the true boundary condition for any problem involving 
radiation by an isolated system should be one of “no incoming radiation” from the past. 
Connecting this boundary condition with the routine use of retarded solutions of wave 
equations was not a trivial matter in general relativity. The response of the source to the 
emitted flux of radiation was usually calculated by assuming that the flux of energy and 
angular momentum was balanced by an equal loss of those quantities in the motion of the 
source. While this balance assumption made sense, Ehlers et al. argued that there was no 
actual proof that it was valid. Finally, they pointed out that there was no evidence that the 
post-Newtonian approximation, so central to the problem of motion, was a convergent or 
even asymptotic sequence. Nor had the approximation been carried out to high enough 
order to make credible error estimates. 
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During this time, some authors even argued that the “quadrupole formula” for the 
gravitational energy emitted by a system (see Section 11.5.2), while correct for a rotating 
dumbbell as calculated by Einstein in the linearized approximation, was actually wrong 
for a binary system moving under its own gravity. The discovery in 1979 that the rate 
of decay of the orbit of the binary pulsar was in agreement with the standard quadrupole 
formula made some of these arguments moot. Yet the question raised by Ehlers et al. was 
still relevant: Is the quadrupole formula for binary systems an actual prediction of general 
relativity? 

Motivated by the Ehlers et al. critique, numerous workers began to address the holes in 
the problem of motion, and by the late 1990s most of the criticisms had been answered, 
particularly those related to divergences. For a detailed history of the ups and downs of the 
subject of motion and gravitational waves, see Kennefick (2007). 

Additional motivation for this theoretical work came in the early 1990s from the 
experimentalists. By this time, proposals to construct large-scale laser interferometric 
gravitational wave observatories were well advanced, eventually culminating in the LIGO 
interferometers in the United States, and Virgo and GEO600 in Europe. A leading can- 
didate source of waves detectable by these devices is the inspiral, driven by gravitational 
radiation damping, of a binary system of compact objects (neutron stars or black holes) 
(for a review of sources of gravitational waves, see Sathyaprakash and Schutz (2009)). 
It was realized that the analysis of signals from such systems would require theoretical 
predictions from general relativity that are extremely accurate, well beyond the leading- 
order prediction of Newtonian or even post-Newtonian gravity for the orbits, and well 
beyond the leading-order quadrupole formula for gravitational waves (Cutler et al., 1993). 

This presented a substantial theoretical challenge: to calculate the motion and radiation 
of systems of compact objects to very high post-Newtonian order, a formidable algebraic 
task, while addressing the issues of principle raised by Ehlers et al. sufficiently well to 
ensure that the results were physically meaningful. This challenge was largely met, and a 
post-Newtonian description of the inspiral orbit and gravitational wave signal is now in 
hand, approaching the fourth post-Newtonian (4PN) order beyond the approximation used 
to account for the binary pulsar (Blanchet, 2014). 

Meanwhile another major approach to the problem of motion, known as numerical 
relativity, finally began to bear fruit. This is the method whereby one solves Einstein’s 
equations for complex problems, such as the evolution of the orbit of two compact bodies 
and the gravitational waves emitted, by purely numerical means. The effort began with 
the pioneering work by Hahn and Lindquist (1964) and Smarr et al. (1976, 1977), focused 
on the head-on collision of two black holes. Subsequent progress was slow, however, as 
researchers struggled to find formulations of Einstein’s equations that were better suited 
to stable numerical analysis, to learn how to treat the coordinate freedom inherent in the 
theory, and to find schemes to deal with the event horizons of the black holes. Progress was 
also limited by the available computer speed, power and memory. The first full numerical 
solution for the head on collision of two nonspinning black holes was achieved in 1993 
(Anninos et al., 1993; Matzner et al., 1995). The more complicated problem of two orbiting 
and inspiralling black holes proved very challenging, until major breakthroughs occurred 
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in 2005, paving the way for rapid progress (Pretorius, 2005; Baker et al., 2006; Campanelli 
et al., 2006). Baumgarte and Shapiro (2010) provide a pedagogical review of the field. 

Today it is common to see published gravitational waveforms from black-hole inspirals 
and mergers that cover many orbits, that include a range of mass ratios, that incorporate 
black-hole spins together with the complex precessions of both the spins and the orbital 
angular momentum, and that give accurate results for the merger of the two black holes 
and the quasi-normal mode “ringdown” of the final black hole. Remarkably, there is a 
substantial region of overlap where post-Newtonian theory and numerical relativity are in 
“unreasonably” good agreement. This has permitted the development of phenomenological 
approaches that yield accurate analytic formulae that match post-Newtonian theory at early 
stages in the inspiral, and numerical relativity at the final stage. One of the most prominent 
of these is the “effective one-body” (EOB) approach, pioneered by Buonanno and Damour 
(1999). These formulae played a crucial role in the recent detections of gravitational waves 
by LIGO and Virgo (Abbott et al., 2016b, 2016c, 2017b, 2017c). 

At the same time, there has been significant progress in applying numerical relativity to 
systems containing matter, such as neutron stars. Here one must deal with the uncertainties 
in the equation of state of nuclear matter, the effects of shock waves and of magnetic 
fields, and microphysical processes such as radiation and neutrino transport (see Shibata 
and Taniguchi (2011) and Faber and Rasio (2012) for reviews). These advances became 
particularly relevant with the 2017 detection of a binary neutron star inspiral and merger by 
LIGO and Virgo, together with observations of the accompanying electromagnetic signals 
by numerous observatories on the ground and in space (Abbott et al., 2017d, 2017e). 

Needless to say, the “problem of motion” in general relativity looks quite different today 
than it did in 1974! 


11.2 Gravitational Wave Detectors 


The attempts to detect gravitational waves, from Weber’s pioneering yet flawed exper- 
iments to the present international effort involving laser interferometry, pulsar timing, 
and cosmic microwave background observations, is a story rich in sociology, history, 
technological development, and big-science politics. That story, and the details of the 
methods of gravitational-wave detection are beyond the scope of this book. Readers should 
consult the books by Saulson (1994), Maggiore (2007), and Creighton and Anderson 
(2011) for those details. 

The resonant bar detectors used by Weber and others have been almost completely 
phased out, in favor of kilometer-scale laser interferometer observatories. The US Laser 
Interferometer Gravitational-Wave Observatory (LIGO) was constructed between 1994 
and 1999, operated between 2002 and 2010 without making any detections, and then 
after a major upgrade to “advanced LIGO” with higher sensitivity, resumed operations in 
September 2015. LIGO consists of two interferometers, one in Hanford, Washington, and 
the other 3000 km away in Livingston, Louisiana, each with arms four kilometers long. The 
Virgo observatory, operated by a European consortium is in Cascina, Italy, near Pisa, with 
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three-kilometer long arms. It operated between 2003 and 2011, was upgraded, and resumed 
operation as advanced Virgo in August 2017. A smaller interferometer, GEO600, located 
in Hannover, Germany, is operated by a German-UK consortium; because its 600-meter 
arms mean lower sensitivity, GEO600 is today used mainly as a test-bed for advanced 
technologies for future detectors, for specialized searches for gravitational waves, and 
as a backup when the other observatories are off-line for maintenance or upgrading. 
The Kamioka Gravitational-wave detector (KAGRA), an underground three-kilometer 
interferometer in Japan, is nearing completion. Plans are well advanced for LIGO-India, a 
four-kilometer interferometer to be located in Maharashtra state, using interferometer and 
laser instrumentation that was built, but not used, in the final advanced LIGO observatory 
in Hanford. LIGO-India may join the other interferometers around 2024. These ground- 
based detectors are sensitive to gravitational waves in the frequency band between about 
10 Hz, where seismic and gravity-gradient noise dominate the detector response, and about 
500 Hz, where photon-counting or “shot” noise dominates. 

In addition, plans are being developed for the Laser Interferometer Space Antenna 
(LISA). Consisting of three spacecraft orbiting the Sun in a triangular formation separated 
from each other by 2.5 million km, LISA will be sensitive primarily to waves in the low 
frequency band between 10~* and 10~! Hz. Following the success of LISA Pathfinder, 
a mission designed to test and demonstrate some of the key technologies needed for the 
ambitious mission (Armano et al., 2016), LISA was formally selected in June 2017 by the 
European Space Agency for a launch around 2034, with NASA participating as a junior 
partner. 

In its most schematic realization, a laser interferometric gravitational-wave detector 
works just like the interferometer used by Michelson in the late 1800s to measure the 
speed of light and search for evidence of an “aether.” The real-life interferometers are 
much more complex than this, but this simple model is adequate and captures the essential 
physics. 

A laser interferometer consists of a laser source, a beam splitter, and two end mirrors 
mounted on test masses imagined to be freely moving in spacetime (although in reality they 
are suspended by thin wires). The arms of the interferometers are taken to be perpendicular 
to each other, although this is not essential (in LISA they make angles of 60°). The laser 
beam is divided in two at the beam splitter, and each beam travels along one arm of the 
interferometer, reflects off the test mass, and returns to the beam splitter to be recombined 
with the other beam. The relative phase of the beams determines whether they produce a 
bright or dark spot when the recombined beam is measured by a photon detector. Since 
the initial phases at the beam splitter are identical, the phase difference depends on the 
difference in travel time along the two arms. If a passing gravitational wave induces a 
displacement AL that changes the relative lengths of the two arms, it will induce a varying 
interference pattern. The ground-based advanced LIGO and Virgo are sensitive to strains 
AL/L at the level of 10~”” at 100 Hz. 

In LISA, the multimillion-kilometer long arms make reflection of the laser signal 
impractical. Instead each spacecraft is equipped with an optical system that records the 
phase of each incoming and outgoing laser signal referenced to the location of a freely 
floating test mass inside the spacecraft. By combining the phase information from these 
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six laser links (three in each direction) in a technique known as time-delay interferometry, 
one can measure the relative displacements of each pair of spacecraft. LISA’s peak strain 
sensitivity is expected to be of order 10~7! at about a millihertz frequency (Amaro-Seoane 
et al., 2012). 

The pulsar timing array (PTA) technique is another variant on the interferometry method. 
Here, an array of very stable millisecond pulsars distributed around the sky provides 
the analogue of the laser signal. The phases of the pulsed radio signals are measured 
as they arrive at Earth and monitored over multiyear timescales. Gravitational waves 
passing through the galaxy will induce phase variations in the pulsar signals that are 
sufficiently coherent that they can be distinguished from intrinsic pulsar noise, atomic clock 
noise on Earth and dispersion from the interstellar medium. This technique is sensitive to 
gravitational waves with frequencies of order the inverse of the multiyear observation time, 
or in the nanoHertz regime (for a review, see Manchester (2015)). 

Finally, gravitational waves of frequencies of the order of the inverse Hubble time can 
be probed by looking for a specific class of fluctuations (so-called “B-modes”) in the 
polarization of the cosmic background radiation. 

We now turn to a study of gravitational waves in metric theories of gravity. 


11.3 Speed of Gravitational Waves 


The Einstein Equivalence Principle demands that in every local, freely falling frame, the 
speed of light must be the same — unity, if one works in geometrized units. The speed 
of propagation of all zero-rest-mass, nongravitational fields must also be the same as that 
of light. However, EEP demands nothing about the speed of gravitational waves. That 
speed is determined by the detailed structure of the field equations of each metric theory of 
gravity. Some theories of gravity predict that weak, short-wavelength gravitational waves 
propagate with exactly the same speed as light. By weak, we mean that the dimensionless 
amplitude h,,,, that characterizes the waves is in some sense small compared to the metric 
of the background spacetime through which the wave propagates. that is, 


evel/lenel & 1s (11.1) 


and by short wavelength, we mean that the wavelength \ is small compared to the typical 
radius of curvature 7 of the background spacetime, that is, 


|A/R| <1. (11.2) 


This is equivalent to the geometrical optics limit, discussed in Chapter 3 for electromag- 
netic radiation. In the case of general relativity, for example, one can show (see MTW, 
exercise 35.15) that the gravitational wave vector @ is tangent to a null geodesic with 
respect to the “background” spacetime, that is, 


ig =0, FL =, (11.3) 
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where “|” denotes a covariant derivative with respect to the background metric. In a local, 
freely falling frame, where gf = Nyy, the speed of the radiation is thus the same as 
that of light. Gravitational radiation propagates along the “light cones” of electromagnetic 
radiation. 


Sl? 


General relativity 


A simple method to derive this result in general relativity, which can then be applied to 
other metric theories, is to solve the vacuum field equations, linearized about a background 
metric chosen locally to be the Minkowski metric. Physically, this is tantamount to solving 
the propagation equations for the radiation in a local Lorentz frame. As long as the 
wavelength is short compared to the radius of curvature of the background spacetime, this 
method will yield the same results as a full geometrical-optics computation. Referring to 
Section 5.2.2, where we defined the potential 1°? in Eq. (5.18), we see that the linearized 
vacuum field equation (5.19) in harmonic coordinates takes the form 


ne = 0, (11.4) 


where © denotes the flat-spacetime d’Alembertian. The plane-wave solutions of this 
equation are 


AXP — AP eitux" fen” =0, (11.5) 


where A®® and ¢,, are the constant amplitude tensor and wave vector, respectively. Thus, 
the electromagnetic and gravitational light cones coincide, that is, the gravitational waves 
are null. 


Scalar-tensor theories 


Here the linearized vacuum field equations in harmonic gauge, Eqs. (5.43) and (5.45), take 
the form 


ohne? =0, od=0. (11.6) 


where h? is the potential related to the auxiliary gothic inverse metric by poe = 
ne — g°8, Writing @ = do(1 + VW), where go is the asymptotic value of @ far from 
the system, we obtain the plane wave solutions 


pee = AP eltu™ 2b nt” =0, 
Y= Be Cent” =0, (11.7) 


where B is a constant amplitude. Recalling that 3, = (¢/¢0)g,,, we see that AF is 
related to the gothic metric potential A to linear order by 


Ae? = n°? — Un? , (11.8) 


Although the tensor and scalar waves do not necessarily have the same wave vector 
(depending on the source, ¢,, need not be the same as ¢',), both types of wave are null. 
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This will no longer be true if the scalar field has a potential V(@). If, for example, 
V(b) ~ m*¢”, then the scalar field will propagate like a massive field with a speed less 
than that of light, while the waves of the field /°? will remain null. 


Vector-tensor and TeVeS theories 


In vector-tensor theories the linearized field equations are much more complex than in the 
scalar-tensor case, with the propagation of linearized metric perturbations being strongly 
influenced by the background cosmological value of the vector field K. In general there 
are ten different wave modes (six for the metric, four for the vector field), each with its 
own characteristic speed and polarization, although various gauge symmetries within a 
specific theory may reduce this number. For example, in the Einstein-A2ther theory (see 
Section 5.4.1), there are five modes: two pure metric modes (no vector waves), with speed 
(1 — éa.j- 2, where c, = c; + c3 and three mixed metric-vector modes, with speeds 
that are complicated functions of the parameters c;...c4 of the theory (Jacobson and 
Mattingly, 2004). 

TeVeS theories have the same general vector-tensor structure as the general vector-tensor 
theories (see Section 5.5), and as a result, they predict the same modes with the same 
speeds. At linearized order, the scalar modes decouple from the tensor and vector modes, 
but because of the nature of the interpolating function F(k?h“” ¢ ,,¢,,) that appears in the 
TeVeS action (5.69), the speed of the scalar waves is generally much lower than the speed 
of light (Sagi, 2010). 


Massive gravity 


Many massive gravity theories (Section 5.7) are motivated by a desire to have the metric, 
linearized about a flat spacetime background, obey a massive vacuum wave equation, of 
the form 


nee — (Ze) aa =0; (11.9) 


where mg is the mass associated with the field. Substituting the ansatz h°? = 
A exp(it,.x"), we obtain nv l“l’ = —(m,/h)*. With = w and ¢/ = k/, we obtain the 
dispersion relation w* — |k|? = (m,/h)?. With E = hw and p = hi, this is equivalent to 
E’ — p’ = m;, Then it is straightforward to show that the propagation speed of the wave, 


Vg = dw /dk is given by 
oe 1/2 ¥ 1/2 
= & = 
“= ap) Se) (11.10) 


where Ag = h/mg is the “graviton” Compton wavelength, and A is the wavelength of 
the gravitational waves. This wavelength-dependent propagation speed is analogous to the 
dispersion experience by an electromagnetic wave propagating through a tenuous plasma, 
a phenomenon well-known to radio astronomers. In Section 12.2.3, we will see how the 
gravitational-wave signals detected by LIGO in 2015 and 2017 were used to place a bound 
on the Compton wavelength Ag. 
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11.4 Polarization of Gravitational Waves 
NSS aE 


11.4.1 The F(2) framework 


General relativity predicts that weak gravitational radiation has two independent states of 
polarization, conventionally called the “+” and “x” modes, or the +2 and —2 helicity 
states, to use the language of quantum field theory. However, general relativity is probably 
unique in that prediction; every other known, viable metric theory of gravity predicts more 
than two polarizations for the generic gravitational wave. In fact, the most general weak 
gravitational wave that a theory may predict is composed of six modes of polarization, 
expressible in terms of the six “electric” components of the Riemann tensor Rojoz, that 
govern the driving forces in a detector (Eardley et al., 1973a,b). 

Consider an observer in a local freely falling frame. In the neighborhood of a chosen 
fiducial event P along a world line y, construct a locally Lorentz orthonormal coordinate 
system (7, €/) with 7 as proper time along the world line and P as spatial origin (“Fermi 
normal coordinates’). The metric has the form (PW, section 5.2.5) 


S00 = —1 — Ropog(y)eE4 + O(E?), 
2 
8y = 3 Ripao (1) 6°5" +O(&), 


1 
Bit = 5x — 5Rpig (VEE! + OE). (11.11) 
For a test particle with spatial coordinates €/, momentarily at rest in the frame, the 
acceleration relative to the origin is 


CE 1 
ae = 78007 = Rojog (YE! - (11.12) 


Note that despite the possible presence of auxiliary gravitational fields in a given metric 
theory of gravity, the acceleration is sensitive only to the Riemann tensor. This is not 
necessarily true if the body has self-gravitational energy (Lee, 1974). 

Thus, a gravitational wave may be completely described in terms of the Riemann tensor 
it produces. We characterize a weak, plane, nearly null gravitational wave in any metric 
theory (Eardley et al., 1973b), as observed in some local Lorentz frame, by a linearized 
Riemann tensor with components that depend only on a retarded time uw, that is, 


Rapys = Ropys(t). (11.13) 
The “wave vector” ¢ » defined by 


fy Sts (11.14) 


is normal to surfaces of constant w and is almost null with respect to the local Lorentz 
metric, that is, 


vb, =e, lel <1, (11.15) 
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where €« is related to the difference in speed, as measured in a local Lorentz frame at rest in 
the universal rest frame, between light and the propagating gravitational wave, that is, 


2 
<= (<) = 7 (11.16) 
Vg 


We now wish to analyze the general properties of the Riemann tensor for a weak, plane, 
nearly null gravitational wave. To do this, it is useful to introduce a locally null basis 
instead of the local Lorentz frame’s Fermi normal basis (7, €/). Consider a null plane wave 
(light, for instance) propagating in the +Z direction in the local Lorentz frame. The wave 
is described by functions of retarded time u, where u = t — Z. (we use units in which the 
locally measured speed of light is unity). A similar wave propagating in the —Z direction 
would be described by functions of advanced time v, where v = t + Z. We now define the 
four-vector fields = ¢“e,, andn = n“e,,, where 


1 
Ly = Uy, My = —5¥n (11.17) 


These vectors are tangent to the propagation directions of the two null plane waves. In the 
(r, €/) basis they have the form 


1 
= (1,0,0,1), = 5(1,0, 0, —1), (11.18) 


and are null with respect to 7, that is, 2"0"’n,, = n'"n” nv = 0; note that 2“n’ ny, = —1. 
We also introduce the complex null vectors m = m“e,, and m = m"e,,, where the bar 
denotes complex conjugation, and where 


1 
m* = —(0,1,1,0), m* = —(0, 1, -i, 0). (11.19) 
a ) ( ) 
These vectors are also null, since mm”n,,,=0, and m'm’n,,—=0; note that 
m"in’ Nv = 1. These orthogonality relations imply that 


nh? = —20n”) + Im“ in”) | (11.20) 


For the remainder of this section, we will use Roman subscripts (excluding the usual spatial 
indices i, 7, k) to denote components of tensors with respect to the null tetrad basis @, , m, 
m, that is, Papp... = Pippo" ..., where a, b,c, ... run over , n, m, and m, while p, 
q,1,... Tun over only £, m, and m. 

Because the null tetrad £, n, m, and m is a complete set of basis vectors, we may expand 
the gravitational wave vector £ terms of them. But since the gravitational wave is not 
exactly null, this expansion will depend in general upon the velocity of the observer’s local 
frame relative to the universe rest frame. Choose a “preferred” observer, whose frame is at 
rest in the universe, and let ( in this frame have the form 


Ch = bP (1 + €¢) + eqn” + nm" + Enin" , (11.21) 


where {€2, €n, €n} ~ O(€). However, this observer is free (i) to orient his spatial basis so 
that the gravitational wave and his null wave are parallel, so that 0’ « ¢/, and (ii) to choose 


243 


11.4 Polarization of Gravitational Waves 


the frequency of his positively propagating null wave to be equal to that of the gravitational 
wave, so that £° = £°. This implies that €,, = 0, €¢ = —e,/2, and 


~ 1 
gh = 0 — 6, (50 nt). (11.22) 
Now, because the Riemann tensor is a function of retarded time i alone, we have that 


~ ORawys ah. 44 
Ronse e—l a = 7 Reais: (11.23) 


This, together with the orthogonality relations among the null tetrad vectors, implies that 


Ropys,e = €nRap-s ’ 


Ropyéin = (: - 5) Roupyss 
Rapysm = 9. (11.24) 
The linearized Bianchi identities Rgpjcd,e) = 0 then yield 
Riiian = OER) (11.25) 
which, except for a trivial nonwavelike constant, implies that 
Ratpg = Rogar = O(n) « (11.26) 


Thus the only components of the Riemann tensor that are not O(e,,) are of the form Rapnq: 
There are only six such components, and all other components of the Riemann tensor can 
be expressed in terms of them. They can be related to particular tetrad components of the 
irreducible parts of the Riemann tensor known as Newman-Penrose quantities (Newman 
and Penrose, 1962), specifically the Weyl tensor V4 and the tracefree Ricci tensor ® 4, (see 
MTW section 13.5 for definitions). For our nearly null plane wave in the preferred tetrad, 
they have the form 


1 
Wo = —=Rnene + O(enR) ey 


6 
1 
W3 = —ZRnenm +P O(enR) > 
Wy = —Rimm > 
Pr. = —Ramnm ’ (11.27) 


(3 and V4 are complex). These results are valid for a gravitational wave as detected by 
the preferred observer. Now, in order to discuss the polarization properties of the waves, 
we must consider the behavior of these components as observed in local Lorentz frames 
related to the preferred frame by boosts and rotations. However, we must restrict attention 
to observers who agree with the preferred observer on the frequency of the gravitational 
wave and on its direction; such “standard” observers can most readily analyze the intrinsic 
polarization properties of the waves. The Lorentz frames of these standard observers are 
related by a subgroup of the group of Lorentz transformations that leave é unchanged. 
The most general such transformation is given by 
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( )L—€,(am+am) + O(e), 
n’ = ( ) (n+ aal + Gm + am) + O(e), 
m' = (1 — ade,) e'® (m + ak) — €,ae'® (n + aim) + O(€), 
in’ = (1 — ade,) e'? (m + GL) — eae"? (n + am) + O(e), (11.28) 


where a is a complex number that produces null rotations (combinations of boosts and 
rotations) and ¢ is an arbitrary real phase (0 < @ < 27) that produces a rotation about 
ez. The parameter a is arbitrary except for the restriction aa < ¢,!. This expresses the 
fact that our results are valid as long as the velocity w of the frame is not too close to the 
speed either of light or of the gravitational wave, whichever is less; note that for nearly 
null waves ¢,! > 1, and almost any velocity that is not infinitesimally close to unity is 
permitted, since a@ ~ w*/(1—w7). For exactly null waves, ¢, = 0 and arbitrary velocities 
w < | are permitted. 

Under this set of transformations, the amplitudes of the gravitational wave change 
according to 


U5 = U2. + O(e,R), 

WU, =e"? (W3 + 3aW2) + O(€,R), 

Ui, =e? (Wy + 4a; + 6072) + O(e,R), 

©, = Oy + 2aW; + 2aWV; + 6aaW, + O(e,R). (11.29) 


Consider a set of observers related to each other by pure rotations about the direction 
of propagation of the wave (a = 0). A quantity that transforms under rotations by a 
multiplicative factor e'’? is said to have helicity s as seen by these observers. Thus, ignoring 
the correction terms of O(€,R), we see that the amplitudes U2, Y3, U4 and 2) have 
helicities 


Wo s=0.; Do s= 0, 
U3: s=-l, W: s=4, (11.30) 
Wy, s=-2, Wy = +42 


However, these amplitudes are not observer-independent quantities, as can be seen from 
Eq. (11.29). For example, if in one frame V4 4 0 but V2 ¥ 0, then there exists a frame 
in which &, = 0. Thus, the presence or absence of the components of various helicities 
depends upon the frame. Nevertheless, certain frame-invariant statements can be made 
about the amplitudes, within the small corrections of O(¢,R). These statements comprise 
a set of quasi-Lorentz invariant classes of gravitational waves. Each class is labeled by the 
Petrov type of its nonvanishing Weyl tensor and the maximum number of nonvanishing 
amplitudes as seen by any observer. These labels are independent of observer. For exactly 
null waves (€, = 0), the classes are: 


* Class Ig: V2 ~ 0. All standard observers measure the same value for V2, but disagree 
on the presence or absence of all other modes. 

* Class I//5: Vz = 0, V3 ~ 0. All standard observers agree on the absence of V2 and on 
the presence of V3, but disagree on the presence or absence of V4 and ®9. 


245 


11.4 Polarization of Gravitational Waves 


* Class N3: Vy = 0, V3; = 0, By 4 0, Vy A O. Presence or absence of all modes is 
observer-independent. 

¢ Class Nz: Vy = 0, V3 = 0, B22 = 0, V4 F 0. Independent of observer. 

* Class O;: UV, = 0, V3 = 0, x7 # 0, Vy = 0. Independent of observer. 

* Class Oo: V2 = V3 = B27 = Wy = 0. Independent of observer: No wave. 


For nearly null waves, simply replace the vanishing of modes (= 0) with the nearly 
vanishing of modes [~ O(e,R)]. This scheme, developed by Eardley et al. (1973b), is 
known as the £(2) classification for gravitational waves, since in the case of exactly null 
plane waves (€, = 0), the transformation equations (11.28), are the “little group” E(2) of 
transformations, a subgroup of the Lorentz group. The £(2) class of a particular metric 
theory is defined to be the class of its most general wave. 

Although we have confined our attention to plane gravitational waves, one can show 
straightforwardly (Eardley et al., 1973b) that these results also apply to spherical waves 
far from an isolated source provided one considers the dominant 1/R part of the outgoing 
waves, where R is the distance from the source. 


11.4.2 E(2) class of specific metric theories 


To determine the £(2) class of a particular theory, it is sufficient to examine the linearized 
vacuum field equations of the theory in the limit of plane waves (observer far from source 
of waves). Here we present some examples. Some useful identities that can be obtained 
from Eqs. (11.20) and (11.26) are 


Rae = Rnene + O(EnR) 
Ran = 2Ramnm + O(€nR) , 
Ram = Ratnam + O(enR) ; 
R= —2Ry + O(enR) . (11.31) 


If the Riemann tensor is computed from a linearized metric perturbation p,,,,(a), then 
1 
Ruvap = 2 (Pyp.ov — Puopv + Pvo.By — Prp.ou) 
l/s. Ee oe rie 
= 5 (bab Bus — bob Bua + Ubybva — Labubvs) (11.32) 
and 


1. 
WV, = —put O(enR), 


12 
1. 
W3 = qPen i O(enR) > 
1 
W4= Pim aia O(enR) ’ 
1 
Px = =pmm + O(E,R). (11.33) 


2 
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General relativity 


The vacuum field equations are R,,,, = 0. The waves are null (e, = 0). Eq. (11.31) then 
implies that 


Rnene Ramnm Rnenm 0 > ( 11.3 4) 


and thus that 
W, = VU; = 0, =0. (11.35) 


The only unconstrained mode is V4 4 0, so general relativity is of E(2) class Nz. There 
are only two polarization modes, of helicity +2. 


Scalar-tensor theories 


In the massless scalar case, the linearized vacuum field equations (5.35a) and (5.35b) take 
the form 


no =0, 
Ro Ra Os Ga) 
po — ZN = 5 (byw ~ Muvnd) » 
R=0(¢,”), (11.36) 


where ¢o is the cosmological boundary value of the scalar field ¢. The solution to the first 
of Eqs. (11.36) for a plane wave is 


ob = doBe™™ , (11.37) 
where 7),,,£"0" = 0. Then from the remaining two of Eq. (11.36), 
Ruy = —Be’" £,,8,. (11.38) 
Thus Rig = Ram = 0, but Ran A 0, thus 
%=%,=0, 6.40,0,40, (11.39) 


and scalar-tensor theories are of class N3. 


Einstein-Ather theory 


Depending on how the source excites waves in the vector field, there are three different 
modes of gravitational wave in Einstein-A&ther theory (Jacobson and Mattingly, 2004). 
In one mode, the vector field is unexcited, leaving a pure gravity mode of class Nz (V2 = 
W3 = 2) = 0) exactly as in general relativity. A second mode has tranverse excitations 
of the vector field, producing gravitational waves of class I//5 (V2 = 0). The third mode 
has longitudinal excitations of the vector field, producing waves of the most general class, 
IIs. Each mode has a different gravitational wave speed. The modes and their speeds are 
summarized in Table 11.1. 
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Table 11.1 Gravitational-wave polarizations and speeds in Einstein-Ather theory. 


Mode vg and small ¢; limit E(2) class 
Pure metric faa > 1 Np 
Ayo 
Transverse vector agar ty Gl IITs 
2e14(1—¢13) c14 
‘ 8 123 2=cy4 123 
Longitudinal vector ey =a Ge 6 


11.4.3 Measurement of gravitational-wave polarizations 


Consider an idealized experiment in which an observer uses an array of devices to 
determine via Eq. (11.12) the six components Rojo, of the Riemann tensor for an incident 
wave. These are generally referred to as the “electric” components, by analogy with the 
electric field components FY of the electromagnetic field tensor. Let us suppose that the 
waves come from a single localized source with spatial wave vector k chosen to be in 
the Z direction. Basis vectors ey and ey are chosen to be perpendicular to the Z direction, 
and therefore to lie in the plane of the sky, with the orientation chosen to reflect some 
characteristic of the source (for example, the line of the ascending node of an orbit inclined 
with respect to the plane of the sky). If the observer expresses her data on the displacements 
€* as a3 x 3 symmetric matrix, defined by 


1 
Rojox = — 5p qa diel) (11.40) 


where R is the distance to the source, then, to first order in the small displacement £*, 
Eq. (11.12) gives 


1 
E(t) = (0) + sp e(E(0), (11.41) 
where A;, can be written in the form 


Ay As Ax Avi 
Aj = Ay —-A,+As Avo : (11.42) 
Ay Ay AL 


The standard XYZ orientation of the matrix elements is assumed. From Eqs. (11.18), 
(11.19), and (11.27), the amplitudes in Aj, are related to the Newman-Penrose quantities by 


di ies ie 
Rey = Ay, ImWy = - 4x, 
1 1 
Rel, = Bes, Tita = (a 
+= Fag 3 ey ae 
= I . 
®5, = —A W, = ——~A_. 11.4 
2 = aps, 2 = Tppdt (11.43) 


For each polarization mode, Eq. (11.41) produces a specific distortion of, say, a sphere of 
freely moving particles, as displayed in Figure 11.1. 
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° 
one? 


(e) Re Ys (f) Im V3 


Displacements induced on a sphere of free particles by each polarization mode of a plane gravitational wave 
permitted in any metric theory of gravity. The wave propagates in the +Z direction and has a sinusoidal time 
dependence. The solid and dashed lines are snapshots at wt = 0 and 7, respectively. There are no displacements 
perpendicular to the plane of the figure. In (a), (b) and (c), the wave propagates out of the page; in (d), (e), and (f), 
the wave propagates in the direction of the arrow. 


Now, if the observer knows the direction A a priori, either by associating the wave 
with an electromagnetic counterpart, such as a gamma-ray burst, or by triangulating the 
direction using the arrival times of signals detected at widely spaced antennas, then by 
choosing the Z-axis parallel to k, she can determine uniquely the amplitudes as given in 
Eq. (11.42), and thereby obtain the class of the incident wave. Because a specific source 
need not emit the most general wave possible, the £(2) class determined by this method 
would be the least general class permitted by any metric theory of gravity. 
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/ 
€5 
ex 
ej 
ey 
al 
Relation between the detector basis (e,, e2, e3) and the source basis (ey, ey, €7 = —é}). 


However, if the observer does not know the direction a priori, it is not possible to 
determine the £(2) class uniquely, since there are eight unknowns (six amplitudes and 
two direction angles) and only six observables Aj. In particular, any observed Aj, can 
be fit by an appropriate wave of class //, and an appropriate direction. However, for 
certain observed Aj, the E(2) class may be limited in such a way as to provide a test 
of gravitational theory. For example, if the driving forces remain in a fixed plane and are 
pure quadrupole, that is, if there is a fixed coordinate system in which 


A pb O 
An=| wp -rA 0], (11.44) 
0 0 O 


then the wave may be either //5 (unknown direction), or V2, (direction parallel to the z-axis 
of new coordinate system). If this condition is not fulfilled, however, the class cannot 
be N2. Such a result would be evidence against general relativity. Eardley et al. (1973b) 
provide a detailed enumeration of other possible outcomes of such idealized polarization 
measurements. 

We must now link this idealized description of the polarization modes to measurements 
made by realistic gravitational wave detectors, for which the natural basis is defined by the 
detector, not the source. 

For an Earth-bound laser interferometer, the natural basis is one where the directions of 
the two arms e, and e> define the x and y axes, and e3, perpendicular to the Earth’s surface, 
defines the z axis (see Figure 11.2). The source direction is then characterized by the polar 
angles 6 and ¢, and the orientation of the source basis relative to the rotated detector basis 
(the primed basis in Figure 11.2) is characterized by a phase 7 (note that k = ez = —e5). 

In addition, the detector response is somewhat more complicated than our ideal 
measurement, because it involves finding the difference in displacements of two free 
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bodies (suspended mirrors) at the ends of the perpendicular arms of the interferometer. 
It is straightforward to show (see, e.g., PW, section 11.5) that the phase difference between 
the two laser beams when they are recombined after travelling along each arm is given by 


= A4tvLo 
OR 


where v is the laser frequency, Lo is the fiducial length of the arms, and the interferometer’s 
“response function” S(t) is given by 


A® 


S(t), (11.45) 


S(t) = 5 (clet — ele) a*(). (11.46) 


Taking into account the rotations between the detector basis and the source basis that 
defines 4", we obtain 


S(t) = F,Al +FyAl. + FsAg + Fy Ay, + Fv24y2 + FLA (11.47) 
where the “angular pattern functions” F', are given by 
1 
Fr= 5 + cos’) cos2¢, 
Fy, = —cos@sin2¢, 
1 
Fy = 5 sin?0 cos 2¢, 
Fy, = —sin@cos6@cos2¢, 
Fy. = sin@sin2¢, 
1 
FL = 5 sin’ cos 2¢, (11.48) 
and where 
A! = A; cos2y + Ax sin2y, 
Al, =A, sin2y — Ay cos2y, 
Ay, = Avi cosy + Ay2 siny, 
Ay = Avi sin y) — Ay2 cos ~. (11.49) 


Notice that the 4’, amplitudes are simply the original amplitudes in the source basis 
projected onto the basis vectors e/ and e}, with the factor 27) in A’, and 4’, arising because 
those modes have helicity 2. Also notice that, since F's = —F., measurements using laser 
interferometers cannot measure the scalar and longitudinal modes separately, but can only 
measure the difference As — A,. If the direction to the source is known, then a suitable array 
of five interferometers with distinct orientations, leading to five distinct values of 6, ¢, can 
in principle measure the five amplitudes A’, , A’., A\,,, Ay, and As — Ay. One can also show 
that, for an interferometer whose arms make an angle y relative to each other, the angular 
pattern functions of Eqs. (11.48) are simply multiplied by a factor sin y (Cutler, 1998). 
One might be tempted to imagine that, when the full network of kilometer-scale 
interferometers consisting of the two LIGO devices and Virgo, plus KAGRA and LIGO- 
India are online, such tests of polarization might become routine. However, the arms of the 
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two LIGO interferometers in Hanford and Livingston are actually parallel to each other, 
apart from a small relative angle of about 27° induced by the curvature of the Earth’s 
surface and a relative rotation of 90°. As a result, e|! = ef, ef) = —ef, and S"(t) + —S*(t). 
This alignment is the result of a decision made early in the development of LIGO to forgo 
any sensitivity to multiple polarizations in favor of the redundancy provided by almost 
identical responses to any given signal. This decision was validated in the initial detection 
of GW150914, where one can see by eye in the almost raw data (mildly band-pass filtered 
to suppress low-frequency and high-frequency noise) that the chirp signals recorded in the 
Hanford and Livingston detectors are the precise negatives of each other. 

Thus to carry out polarization tests, it will be necessary either to add an interferometer 
to the network, to combine data from sources of similar type (such as binary black 
hole mergers), or to focus on restricted classes of theories, such as N3, N4 or O,. For 
specific analyses, see Wagoner and Kalligas (1997), Brunetti et al. (1999), Gasperini 
(1999), Magueijo (2003), Wen and Schutz (2005), and Nishizawa et al. (2009). By 
combining signals from various interferometers into a kind of “null channel” one can test 
for the existence of modes beyond the + and x modes in a model-independent manner 
(Chatziioannou et al., 2012). 

The space-based interferometer LISA can also measure polarizations. In this case, 
because the arm lengths (~ 2 x 10° km) may be comparable to or longer than the 
gravitational wavelength, our analysis based on local Fermi normal coordinates is no 
longer appropriate. Instead one uses an approach called “time delay interferometry” (TDI) 
(Armstrong et al., 1999), in which one determines the effect of the gravitational wave on 
each of the six laser beams propagating between the three spacecraft at the moment of 
emission and the moment of reception. The result is six “Doppler response functions,” two 
for each spacecraft. Certain linear combinations of these functions completely suppress the 
contributions of laser noise, for example. Because the orientation of the spacecraft triangle 
changes as the constellation orbits the Sun, the response to the incoming gravitational wave 
varies continuously, permitting both localization of the source and measurement of the 
polarization modes (Tinto and Alves, 2010; Nishizawa et al., 2010). Gravitational-wave 
polarization can also be studied using pulsar timing arrays (Lee et al., 2008; Alves and 
Tinto, 2011; Chamberlin and Siemens, 2012). 


11.5 Generation of Gravitational Waves 
—— Se 


It is common knowledge that general relativity predicts that the lowest multipole emitted 
in gravitational radiation is quadrupole, in the sense that, if a multipole analysis of the 
gravitational field in the radiation zone far from an isolated system is performed in terms 
of tensor spherical harmonics, then only the harmonics with @ > 2 are present (see Thorne 
(1980) for a thorough discussion of multipole-moment formalisms). For material sources, 
this statement can be reworded in terms of appropriately defined multipole moments of 
the matter and gravitational-field distribution within the near-zone surrounding the source: 
the lowest source multipole that generates radiation is quadrupole. For slow-motion, 


252 


Gravitational Radiation 


weak-field sources, such as binary star systems, quadrupole radiation is in fact the dominant 
multipole. The result is a gravitational waveform in the radiation zone given, to leading 
post-Newtonian order, by 
Des 

h(t) = go es (11.50) 
where h/* is defined by Eq. (5.18), R is the distance from the source, T = ¢ — R is retarded 
time, dots denote time derivatives, and J/* is the quadrupole moment tensor of the source, 
given by 


= ‘| px Bx , (11.51) 


where p* is the conserved density. The waveform h/* can be related to the measured electric 
components of the Riemann tensor by Eq. (11.33), 


Ririnm = —V4 = — Fea (11.52) 
The rate of change of energy of the system resulting from the flux of energy carried by the 
waves to infinity is given by 
dE 1 -(jk) (ik) 
7 rs (11.53) 
where [") = p/k — I §J7 js the tracefree quadrupole moment tensor. 

Eqs. (11.50) and (11.53) are collectively referred to as the “quadrupole formulae” for 
gravitational radiation. It has also become the accepted convention to refer to Eqs. (11.50) 
and (11.53) as the “Newtonian” waveform and energy flux. Even though Newtonian 
theory does not admit gravitational radiation because the interactions are considered to 
be instantaneous, the term “Newtonian” establishes these expressions as the lowest order 
terms in a post-Newtonian sequence of contributions, according to general relativity. Thus 
one can talk about 0.5PN, 1PN, 2PN corrections to these expressions. 

For a binary system with total mass m = mj, + mz, reduced mass parameter 7 = 
mm /m*, orbital separation r and relative velocity v, we obtain the waveform 


nit = 4 (v4 = “wnt (11.54) 
and the energy loss rate 
dE 8 2 m 4 P) By) 
—=-—7 (—) (12v -1lr), 11.55 
di 15! (5) ee vee) 


where 7 = dr/dt (Peters and Mathews, 1963). This loss of energy results in a decrease of 
the orbital period P at a rate given, to lowest PN order, by P/P i 3E /E. 

These comments apply to the asymptotic properties of the outgoing radiation field. 
However, if we are interested in the response of the source itself to the emission of the 
radiation we must deal with the field within the near zone at a level of approximation that 
incorporates the effects of gravitational radiation back reaction. A variety of computations 
in general relativity have yielded a local radiation-reaction force that, in a particular gauge, 


takes the form 


F 20). 
Fi = — 5 x, (11.56) 


React ~— 
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where the superscript (5) denotes five time derivatives. In the equations of motion, this 
represents a correction to Newtonian gravity at 2.5PN order. This leads to a loss of 
mechanical or orbital energy by the system that agrees with Eq. (11.53). 

Quadrupole radiation also leads to a decrease of the orbital angular momentum and to a 
corresponding decrease of the orbital eccentricity. 

Unlike general relativity, however, nearly every alternative metric theory of gravity 
predicts the presence of lower multipoles — monopole and dipole — in gravitational 
radiation, in addition to the quadrupole and higher multipole contributions (Eardley, 1975; 
Will and Eardley, 1977; Will, 1977). At zeroth PN order, the monopole moment is just 
the total rest mass of the system, which is conserved. Thus two time derivatives of the 
leading contribution to the monopole moment will vanish. At the first PN order, there are 
contributions to the monopole moment of order my”, which is of the same order as two 
time derivatives of the quadrupole moment, and so these contributions to the monopole 
moment serve mainly to modify the radiation at quadrupole order. At zeroth PN order, the 
dipole moment is proportional to the center of rest mass of the system, which at lowest 
order moves at constant velocity; thus two time derivatives of that quantity will vanish, by 
virtue of the Newtonian equations of motion. However, post-Newtonian corrections to the 
dipole moment need not share this property, and thus there could be dipole contributions 
to the radiation. 

By contrast, in general relativity, the monopole moment turns out to be the total mass 
of the system, which is constant by virtue of the full equations of motion, at least up 
to the 2.5PN order at which the loss of mass to the flux of gravitational waves must be 
incorporated. Similarly, the dipole moment turns out to be the fu// dipole moment, or center 
of mass of the system, which is a linear function of time by virtue of the full equations of 
motion, at least up to the order at which the system experiences a recoil due to the emission 
of linear momentum in gravitational waves. These properties, which are a result of the 
remarkable adherence of general relativity to the Strong Equivalence Principle, account 
for the fact that quadrupole radiation leads the multipolar contributions to gravitational 
waves in general relativity. 

In many alternative theories, the most important contribution to dipole radiation comes 
from a 1PN-order, residual dipole moment of gravitational energy, schematically of the form 
Di ~ f p*Ux/d?x. Focusing on the part of U that is the internal gravitational potential 
of each body, we can express the dipole moment in the form D/ ~ >~, Q,x), where 


Qq = —5 J, p* Ud?x. For a binary system with its center of mass placed at the origin 
of coordinates, we have to lowest PN order, x; = (mz/m)x and x. = —(mj,/m)x, and we 
have that 
D/ ~ nm8x’, (11.57) 
where 
Q Q 
c= (11.58) 
my, m2 


is the difference in self-gravitational energy per unit mass between the two bodies. For 
compact bodies, S typically is related to the difference s; — s2 between the sensitivities of 
the two bodies, as defined in Section 10.3. This varying dipole moment can thus contribute 
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both to the gravitational waveform and to the fluxes of energy and angular momentum. 
Furthermore, if the bodies are compact bodies, such as neutron stars, where 2/m could 
be as large as 0.3 or even black holes, where the analogue of 2./m could be as large as 
unity, the contributions of dipole radiation could be larger than expected. In some theories, 
the flux of energy due to dipole radiation is similar to that in electromagnetism, namely 
dE/dt~ D-D ~ 9? (m/r)*S?. 

Generalizing the quadrupole energy flux of general relativity, Eq. (11.55) and adding the 
dipole energy flux, we can write down a parametrized formula for the leading energy loss 
rate from binary systems in a class of alternative theories, given by 


4 
o =-1g7! (=) E (Kv? — K2i*) 4 srs?) (11.59) 


where G = Qj is defined by Eq. (10.29), &,, &2 and Kp are parameters that vary from 
theory to theory. In general relativity, G = 1, h) = 12, K2 = 11 andkp = 0. 

The foregoing remarks are rather broad and qualitative. In the end, one must treat 
each theory in turn and calculate its gravitational waveform and fluxes of energy, angular 
momentum and linear momentum to the PN order needed. The next section attempts to 
provide a “recipe,” largely based on experience with general relativity and scalar-tensor 
theories, for carrying out such calculations. 


11.5.1 Calculating gravitational waves in alternative theories 


In this section we will outline a general approach for calculating gravitational waves using 
a formalism developed for general relativity by Epstein and Wagoner (1975) and Wagoner 
and Will (1976), and extended by Will and Wiseman (1996) and Pati and Will (2000, 
2002). This approach is closely related to the “post-Minkowskian” framework of Blanchet, 
Damour, and Iyer (for a thorough review, see Blanchet (2014)). Another approach, inspired 
by quantum field theory, is “Effective Field Theory” framework pioneered by Goldberger 
and Rothstein (2006) (for a review, see Foffa and Sturani (2014)). 

We will focus our attention on gravitational waves from N-body systems (with N=2 
being the most important case), assuming that the bodies are small compared to their 
separations. The two-body case is relevant for binary pulsar observations and for the first 
detections of gravitational radiation from compact binary inspiral. 

Other potential sources of gravitational radiation, such as core collapse in supernovae, 
spinning deformed neutron stars, collisions of cosmic strings, and cosmological gravita- 
tional waves, involve very different assumptions, considerable input from nongravitational 
physics, and substantial numerical computation; these are beyond the scope of this book. 

Because of the complexity of many alternative theories of gravitation beyond the post- 
Newtonian approximation, it has proven impossible to devise a formalism analogous to 
the PPN framework for describing gravitational radiation from dynamical sources. But 
we can provide a general description of the method used to arrive at the gravitational 
waveform and the energy and angular momentum fluxes within a chosen theory of gravity, 
emphasizing those features that are common to many currently viable theories, such as 
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those described in Chapter 5. Later, we will treat specific theories. The method proceeds as 
follows: 

Step 1: Select a theory, identify the variables and set the cosmological boundary condi- 
tions (follow Steps 1-3 in Section 5.1). Restrict the adjustable constants and cosmological 
matching parameters to values that give close agreement with solar-system tests. 

Step 2: Working in the universal rest frame, expand the metric and any scalar, vector or 
tensor fields about their asymptotic values: 


0 
Suv = ow + Pu ’ 


g=O(1+W), 
K*=(K+#, kB, PR), 
Buy = BO) + Buv- (11.60) 


We assume that the vector field is timelike, so that, in the universal rest frame far from 
the source, it has only a time component. All current theories with vector fields make this 
assumption. In theories with fields that single out a preferred rest frame, it may actually 
be advantageous to transform to the rest frame of the system under study at the beginning 
of the calculation, rather than at the end, as we did for the post-Newtonian limits of such 
theories. 

Step 3: Derive the “relaxed field equations.” Using any gauge freedom available, express 
the field equations in the form 


y4 = -16nr4, (11.61) 


where 74 is one of the fields Py» Y, and so on, or possibly a linear combination of fields, 
and the “source” 7“ typically consists of matter and nongravitational field stress-energies, 
and of “gravitational” stress-energies consisting of terms quadratic and higher in the fields. 
The operator 0 is given by 


ee ae (11.62) 
VoA 


where the wave speed vz, is a function of adjustible constants and matching parameters of 
the theory, and could be different for different 1“. 

Because we will be dealing only with Lagrangian-based metric theories, we know 
that each theory admits a pseudotensorial object 7“” that reduces to the matter energy- 
momentum tensor 7” in the absence of gravity, and that satisfies the equation 


gh ape, (11.63) 


(see Lee et al. (1974) and the discussion of Section 4.4). The system of Eqs. (11.61) 
and (11.63) is sometimes referred to as “relaxed” field equations, because one can solve 
Eq. (11.61) formally using the Green function for the operator O as a functional of the 
matter variables, without specifying the behavior of the matter (this is the “relaxed” aspect 
of the procedure). Then one can use Eq. (11.63) to obtain the matter behavior as a function 
of time and thereby obtain the full solution for the fields of the theory (see PW section 6.2 
for further discussion). 
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Iterating the relaxed field equations 


If one assumes that the fields are weak, so that 7" < 1, then itis natural to approximate solutions to Eq. 
(11.61) by a process of iteration. First substitute a" — 0 in the right-hand-side of (11.61), so that 7“ consists 
of only matter contributions, and obtain the first iteration Vy . Then substitute those solutions back into the 
right-hand side of (11.61), and obtain the second iterated fields Vy: Continue the process until the desired 
accuracy is reached. 

There is a persistent legend that, in order to obtain the gravitational wave signal and energy flux in 
general relativity at the leading quadrupole order, one only needs to obtain the first iterated field, which 
corresponds to linearized theory. While this is correct for the radiation from a source governed internally by 
nongravitational forces, such as an object in the laboratory, it is incorrect for a self-gravitating source, such 
as a binary star system. This is because the equations of motion that follow from the first iteration are given 
by Ty, , = 9, which do not contain gravitational interactions. A second iteration of the fields is required 
in order to arrive at a solution for gravitationally interacting systems that is compatible with the equations of 
motion Ty, = 0, which now contain Newtonian gravity at leading order. The fact that the final answer, the 
so-called “quadrupole formula” (11.50), is formally the same for binary systems as for laboratory sources is both 
coincidental and misleading. The difference is in the meaning of the two time derivatives: for binary systems, 
one inserts the Newtonian equation of motion for the acceleration generated by two time derivatives, while 
for a laboratory system one inserts equations of motion appropriate for a system with only nongravitational 
interactions, such as a dumbell governed by internal atomic forces. The underlying reason for this formal 
coincidence is the Strong Equivalence Principle, which asserts that certain behaviors of systems, such as their 
motion or the waves they generate, are not sensitive to the internal structure of the bodies or systems, so 
that the leading-order waves generated by either a dumbell or a binary system depend only on the mass 
quadrupole moment. 

This coincidence cannot be expected to hold in alternative theories of gravity, which generically violate SEP. 
Thus one cannot rely upon the linearized approximation to yield sensible results, but instead must carefully 
consider the structure of the relaxed field equations (11.61) and (11.63) in the theory and determine the number 
of iterations required to obtain solutions for the field and the equations of motion that are compatible with 
each other. For a detailed discussion within general relativity of this iteration process and of how to determine 
the number of iterations needed for a given situation, see PW, section 6.2.3. 


Step 4: Obtain the formal solutions in the wave zone. The solution of Eq. (11.61) that 
has outward propagating disturbances at infinity is 


A t— eee : / 
u(t, x) =4f7 ( ne *) By. (11.64) 


The integration is to be carried out over the past “null cone” C(x) of the field point (¢, x), 
that is, over those points (7’, x’) such that ’ = t— |x —x’|/vgy (see Figure 11.3). In dealing 
with slow-motion sources, it is conventional to divide the spatial domain into a near zone, 
with r < FR and a far zone or wave zone, with r > R, where R ~ A. ~ t., where t. 
and A. are the characteristic timescale of the source and the characteristic wavelength of 
the radiation, respectively. The vertical world tube D in Figure 11.3 is the interior region 
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Wx) N 


Integration domains for the retarded solution (11.64) of the wave equation (11.61) with the field point in the wave 
zone: C(x) is the past “null” cone (with speed Vga) of the field point x; D is the world tube of the near zone of 
radius R; N(x) is the intersection of D with C(x) and W(x) is the rest of the null cone; 4 (x) is a constant 
retarded-time hypersurface used for calculating multipole moments. 


r < R projected in spacetime. The hypersurface N (x) is that piece of the null cone C within 
the near zone D, and the hypersurface W(x) is the rest of the null cone. Thus the integral 
for the field ¢4 can be written as an integral over NV’ (x) plus an integral over W(x), that is, 


W(t, x) = var(t x) + W(t x). (11.65) 


To obtain the gravitational waveform and the energy flux, we want to evaluate the fields 
in the far wave zone, and retain only the leading contributions in powers of 1/R, where R 
is the distance to the source. In the integral WArs |x’| < R, while R >> R, so we can expand 
|x — x’| in powers of x’/R to obtain 


x! 
We = Ses (:- = + ae ) d’x'+O(R~’), (11.66) 


VgA VgA 


where N = x/R. For dynamical sources, t. ~ r-/v-, where r, is a characteristic scale of 
the material source and y, is a characteristic velocity within the source, and thus for slow- 
motion sources (Vv. < 1), R ~ t. > r,, so the matter source is confined to a region deep 
within the near zone, and 0/0t ~ v./r.. Thus we can expand the retardation in Eq. (11.66) 


in the form 
P 4 I (2)'/ a R ; y\mM Boy =2 
= t— —,x')(N-x) dx'+O(R~), (11.67 
ONG R > m! (Vga )™ Ot M Ved ( ) x ( ) ( ) 


where the integration is now over the constant retarded-time hypersurface M (see 
Figure 11.4). Note that 


Wir = — (Vea) Nia + O(R). (11.68) 


The source 74 will contain contributions proportional to the matter variables p*, p, I, v, 
and so on, which have “compact support,” that is, are nonzero only in finite spatial regions 
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W(x) 


r Fig. 11.4 | Integration domains for the retarded solution (11.64) of the wave equation (11.61) with the field point in the near 
zone: here M (x) is a constant time hypersurface at time f within the near zone. 


where the matter resides. However, at the second iteration and beyond, 74 will also contain 
contributions proportional to products of fields and their derivatives. These contributions 
extend over all spacetime, although they generally decrease with distance from the source. 
Thus in integrating the multipole moments of 74 over the hypersurface M, the integrations 
must be extended all the way to the boundary of the near zone, at 7. 

Because of these field contributions to 7“, one must also include the integral 7),(t, x) 
over the rest of the past null cone, beyond the near zone. It turns out that this integration 
can best be done by changing variables from d°x’ = r’?dr’ dQ’, where d(’ is the element of 
solid angle, to a null-type variable u! = vg4t—r! —|x—x’|, and dx’ = r'(u’, 0’, 6')?du'dQ’ 
(Wiseman and Will, 1991). This change is motivated by the fact that the fields that make up 
r4 in the far zone are themselves retarded fields. Both the inner and the outer integrals will 
depend in general on the radius R where they terminate, but since that radius is chosen arbi- 
trarily, the final result cannot depend on 7. This realization then provides a procedure for 
identifying and retaining in v%,(¢, x) and in w%,(¢, x) only contributions that do not depend 
on R, and throwing away contributions that depend on 7. In fact it was shown explicitly 
that all 72-dependent contributions actually cancel each other in the sum warty, (Pati and 
Will, 2000). In general relativity, it turns out that the contributions of ans first occur 1.5PN 
orders beyond the leading quadrupole approximation; they are known as gravitational- 
wave “tails.” While it is likely that, in an alternative theory of gravity, these outer integrals 
will contribute at a similar PN order, they must be carefully examined to determine the 
precise order at which they will contribute. In fact, in theories of gravity with dipole 
radiation, dipole “tails” from pe, can occur in the waveform at 1PN order, instead of 1.5PN 
order. 

Step 5: Obtain a post-Newtonian expansion of the source 7“. This will be needed both 
for evaluation of the multipole moments of 74 in Eq. (11.67) to the required order, and 
for obtaining the equations of motion to the required order. For this purpose, we need the 
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near-zone solution for the fields y+. For |x — x’| < R, we can carry out the expansion of 
the inner integral war in the form 


co 


A eis / U /jm—1 3307 
=4 Y a |] or (t - d 
YN “> m! (Vea) Jo Eel - 


4 (t,x! 40 
“th - BN) Bey I 4 (t,x')a@x! 
|x — x’ | Vea Ot 


26 
ve Or 


i: T (t,x) |x —x'|d*x' +... (11.69) 


The first term is an instantaneous solution of Poisson’s equation with the source 7“ (called 
a Poisson potential), and, because 0/0t ~ v/r, subsequent terms are of 0.5PN order, 1PN 
order, and so on. One must also evaluate the outer integral y, for fields points in the 
near zone, although generally speaking, this integral begins to contribute only at very high 
post-Newtonian order (4PN in the case of general relativity). 

Depending on the tensorial nature of r“, in the near zone it will typically be made up of 
terms of the form 


eA es p*, p*v’, p, p*U, (VU), UjUx «.. scalar, tensor (11.70) 
p*v!, p*vv/, p* Uv, pvi, VWI. VU, ... vector, . 
if the source is modeled as a fluid with weak internal gravitational fields, or 
wi, malta), Ma(ba)”, ma(ba)U, (VU)*, UjUs,... scalar, tensor (11.71) 
ma(wa)’, ma(a)v'w, ma(s)UW, VVI- VU, ... vector, 


if the source is modeled as a system of compact objects with masses dependent upon fields 
W, with appropriately defined “sensitivities” (see Section 10.3). In the compact-body case, 
each mass is accompanied by a delta function, augmented by a suitable prescription for 
regularizing infinite self-fields effects. Here “scalar” or “tensor” refers to quantities such 
as Poo, Pik» VU, k®, and so on, while “vector” refers to quantities such as po, k/, and so on. The 
wave-zone field is then determined by multipole moments of the source 74 (Eq. (11.67)). 

Various post-Newtonian conservation laws can often be used to simplify the multipole 
moments of 74 in Eq. (11.67). For example, the monopole moment f M p*d°x is constant 
because of the conservation of baryon number of the system. If a collection of terms in a 
multipole moment can be combined to yield a component of the pseudotensorial quantity 
T#” then one can exploit a number of identities that follow from 7“” ,, = 0 to simplify or 
transform — These identities include 


eee eh) ys (11.72a) 


(ee x") oot cate + ae = 5 (Te x) og 


a ae 
Ae! 
2 
4 [( (Orly iyh) | — 7 lO — riot | 3 (11.72b) 


pik = ; Ga 00 Ayck i + PA 4 ; (7 x/x*) (11.72c) 


p- 
Note that, because we are dealing with semiconservative theories of gravity, 7“” is not 
necessarily symmetric, so we have retained contributions of the antisymmetric parts of 
THY where necessary. 
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Step 6: Determine the energy, momentum and angular momentum loss rate in terms of w. 
Generally speaking, this involves using the pseudotensorial quantity 7“” and its associated 
conservation law 7“” ,, = 0. If one defines a quantity P° according to 


i / THB x, (11.73) 
then it is straightforward to show that 
dPt 
a = - § HBS, (11.74) 


where the surface integral is to be carried out in the far wave zone. Since the matter energy- 
momentum tensor 7“” has compact support, it will vanish in the far zone, and thus only 
the contributions from the fields 74 to 7’ will be needed. And because aS; = RnidQ 
and we wish to take the limit as R —> 00, only the R~? contributions to 7” are relevant. 
Making use of the far-zone behavior of Eq. (11.68), one can express the rate of change of 
P® and P’ in the generic form 


dP® R? f ra 

= LP MM adpaO, 

dt 327 eT 

dP/ R fie auc 

aE Bop § MM bbgNlaQ, (11.75) 
AB 


where N‘ denotes a product of ZL unit vectors, and 6” denotes a product of M Kronecker 
deltas. The Z and M indices are suitably contracted with indices on the 74s. In integrating 
over solid angles, one makes use of the identities 


4n L/2 
§ nian _ Jw [o“/? + sym(q)] Leven, (11.76) 
0 Lodd, 


where sym(q) denotes all distinct terms obtained by permuting indices on the Kronecker 
deltas, with g = (¢ — 1)! ! denoting the total number of terms. 
Similarly one can define an angular-momentum quantity, given by 


jras / MOP y, (11.77) 
whose rate of change is given by 
rik ; : 
— =-2 } TP —2 § Vn Ps, . (11.78) 


The first term in Eq. (11.78) is not related to gravitational radiation; it is the result of the fact 
that some theories of gravity are only semi conservative (see Section 4.4 and Eq. (4.70)). 
For the surface term, the presence of the factor x/ oc R means that one must calculate the 
subleading R~? terms in 7” in order to find a nontrivial result (the R~? contributions give 
zero because of an odd number of unit vectors). This means obtaining the subleading R~* 
terms in the 74. Section 12.2.4 of PW gives a detailed derivation within general relativity. 

In some theories of gravity, T,,, is not unique; scalar-tensor theories are examples (Lee, 
1974). Therefore one must determine the physical meaning of the quantity P®. In general 
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relativity, it is known to be the Kepler measured mass, or the Bondi mass of the system 
(when evaluated at spacelike infinity, it is also the Arnowitt-Deser-Misner (ADM) mass). 
In scalar-tensor theories, different choices for T,,,, lead to different masses: tensor mass, 
scalar mass or active (Kepler) mass. In the end, it is not so important which type of “mass” 
is represented by P°, but rather what changes in observable parameters are induced by the 
change dP°/dt. Therefore one must carefully calculate P° to an appropriate PN order in 
order to relate it to quantities such as orbital period, which are measurable. 

Step 7: Apply to binary systems. We consider a system made of two bodies that are small 
compared to their separations (d < r); that is, we ignore all tidal interactions between 
them. We may thus treat each body’s structure as static and spherical in its own rest frame. 
We then follow the procedure of Section 6.3: for a given element of matter in body a, we 
write x = X, + X and v = v,, where 


1 1 
— [0 (1411-30) xés, 
Ma Ja 2 

* 1_ B 
Ma = | p bade ax, 


Xa 


m4 
C= ae 
eee p* B 
v= ax. (11.79) 
a jee] 


We note that m, is conserved to post-Newtonian order, as long as tidal forces are neglected. 
The full Newtonian potential U for spherical bodies is given by 


5 m fos 
U(x, t) =U+ 2 > __ vinside body a 
bee |x — X;| 
Mp : 
a » —_.,, x outside bodya. (11.80) 
b lx — X;| 


In the compact-body approach, each body is located at the singularity of the delta function, 
internal gravitational fields are regularized appropriately and the masses are expanded in 


terms of sensitivities. Only the external potential is present, where now m, = ma( (0), 
where po represents the asymptotic values of the relevant auxiliary fields. 
Step 8: Apply balance equations to obtain the orbital evolution. Let us suppose that 7“” 


has been chosen so that, to the appropriate orders, P® and J/ = (1/2)e/?4?4, are given by 


1 1 ‘a 
Pp? = > Mg + sma » Gmam, 4 O(mge) : 


2 r 
i ba ab 


J= JY) mgXq X voll + O(6)], (11.81) 


where ray = |x — xp| and where G = Gjp is the parameter defined in the modified EIH 
Lagrangian, Eq. (10.29). It depends in general on the parameters of the theory and on the 
sensitivities. 


262 


Gravitational Radiation 


For a binary system we may evaluate the orbital terms in Eq. (11.81) using the 
appropriate Keplerian equations, with the result 


2 
p® = m— + O(me’), 


J = nmJ/Gmp h{1 + O(6)], (11.82) 


where m = m, + m2, 7 = mym7/ m?, a is the semimajor axis, related to the binary orbital 
period by (P,/27)* = a*/Gm, and p = a(1 — e”). 

In the emission of gravitational radiation whose source is the orbital motion, we will 
treat the quantities m, and mz as constant because of our neglect of tidal forces and internal 
motions. Then one can obtain a direct relation between dP° /dt and dJ/dt and variations in 
a, e, and P. Thus, for example, if the resulting energy loss rate for a binary system is given 
by Eq. (11.59), then one can average over an orbit to obtain 


0 5 4 
Wa et (=) Fle) — xxpn?S°G7! (=) Ge), (11.83) 
dt 5 a 3 a 


1 7 1 1 1 
F(e) = pt =e) E (1 + xe + i) - 5 hoe (1 + °)| ; 


G(e) = (1-&)7>/? (1 re ¢) (11.84) 


where 


Combining Eqs. (11.82) and (11.83) with the relation between orbital period P, and 
semimajor axis a, we can obtain an expression for the time-derivative P,, 


5/3 
Peete (=") F(e) — 2nnpnS? (=) Ge), (11.85) 
5 Py Py 


where M is called the “chirp mass,” given by 
M =7°/3m. (11.86) 


If one has calculated dP°/dt and dJ/dt to higher PN orders, then it will be necessary to 
evaluate P® and J to a corresponding PN order. 

Although the use of energy and angular momentum balance to infer the behavior of the 
source from flux formulae in the far zone was specifically criticized by Ehlers et al. (1976), 
it has actually been validated within general relativity and scalar-tensor theories to modest 
PN orders beyond the quadrupole formula via first-principles calculations of the radiation- 
reaction forces (Pati and Will, 2002; Will, 2005; Mirshekari and Will, 2013). Unfortunately, 
to reach a given PN order beyond the quadrupole formula, it is much easier to obtain flux 
results than it is to obtain radiation-reaction results. Therefore, in order to work at the high- 
PN orders required to analyze the inspiral of compact binaries, one is forced to adopt the 
balance assumption. 

This completes our recipe for analyzing the generation of gravitational waves. To 
illustrate the method, we will now focus on two metric theories, general relativity 
and scalar-tensor theories where gravitational radiation from compact binaries has been 
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analized in considerable detail. Partial results are also known for some of the alternative 
theories discussed in Chapter 5. 


11.5.2 General relativity 


In Section 5.2.2 we described the post-Minkowskian approach to solving Einstein’s 
equations iteratively, using the Landau-Lifshitz formulation of the equations. We write 
the wave equation for the field h®°, Eq. (5.19) in the form 


Ae = —16r7% , (11.87) 


where 7°? = (—g)(T°* + 1° + #¢°), and where we have set G = 1. Note that, because 
of the harmonic gauge condition he ¢ = 0, we have that T°? g = 0. Using Eq. (11.67), 
and recalling that ve = 1 in general relativity, we obtain the multipole expansion in the far 
wave zone, 


co 


4 1 /0\" 
ap _ * ap = / Ly \n BI =) 
ly; 2a (5) [ie (t—R,x’)(N-x')"dx'+O(R~%). (11.88) 


Because of the harmonic gauge condition combined with Eq. (11.68), we can show that 


hh = NehX, + O(R~*) [modulo a constant], 
Le = NjNhiy + O(R~”) [modulo a constant], (11.89) 


therefore, we need to determine only the jk components. 

For the first iteration nit , the source rit is the special relativistic stress tensor for 
matter 7) ia p*v/v‘, However, the ae of motion that is compatible with the gauge 
eouicen at the first iteration is bee rn = 0, which implies no gravitational interaction 
between the two bodies. huhemoc an inspection of the Landau-Lifshitz pseudotensor 
i reveals that it contributes terms to 7“ of the form U;U,, where U is the near zone 
peaviational potential; these terms are of the same order as the purely kinetic terms in rk ‘ 
and therefore they must be included. This is why two iterations of the relaxed equations are 
essential. However, it turns out that these details can be averted by exploiting the identity 
in Eq. (11.72b). At the lowest multipole order, we have that 


; 4 ; 
i= Rf rh Rx) ey 


2 a 00 PS. PP IK Bd 
= Rae T (t—- Rx) x x"d x’ . (11.90) 
M 


In converting the divergence terms in Eq. (11.72b) to surface integrals at the boundary 
of M, we must verify that there are no 7-independent contributions. Any R-dependent 
contributions can be discarded, as we know that they will be cancelled by corresponding 
contributions from the outer integral He To lowest PN order, 7°° = p*, thus we obtain 
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e. 2¢ 
ae = aa | p* (t—R,x’) x4x*#x' [1 + O(6)] 
M 
Wo 
= 27K —R), (11.91) 


where I/* = >, maxi, x* is the quadrupole moment tensor of the source. This is the well- 
known quadrupole formula for the gravitational waveform. To complete the solution we 
must add the term hi, from the integration over the rest of the past null cone, however this 
is known not to contribute until 1.5PN order beyond the leading quadrupole term. 

It can be demonstrated that the total mass-energy of an isolated system is given by 


E=P= / aa ee (11.92) 
Then, because of the condition 7“? 3 = 0, we find that 
a @&x=— | dx =— gp TIN, Ra (11.93) 
dt — —~ : : 


where dQ) is the element of solid angle. By examining carefully the structure of the Landau- 
Lifshitz and harmonic pseudotensors of Eqs. (5.16) and (5.20) in the far wave zone, and 
taking Eqs. (11.68) and (11.89) into account, we obtain the energy loss rate 


dE R a 
ao age § hit hi dQ, (11.94) 
where the subscript TT denotes the “transverse tracefree” part of h/*, given by 
1 
ht = Ge a. Je (11.95) 


where P/* is the “projection tensor” given by 
Pi = §/* — NINE (11.96) 


We substitute Eq. (11.91) into (11.94) and integrate over solid angles, making use of the 
identities (11.76). The result is the quadrupole formula for the energy loss rate, Eq. (11.53). 
Substituting the quadrupole moment tensor for a binary system, //* = pux/x* + mX/X*, 
taking the time derivatives, and using the Newtonian equation of motion dv/dt = —mx/r’, 
yields Eq. (11.54) for the waveform and Eq. (11.55) for the energy loss rate. Thus, for 
general relativity, «; = 12, K2 = 11, and Kp = 0. 

Analogous formulae for the angular momentum loss rate and the rate of change of linear 
momentum of the source can be obtained: 


dy R? mn mp7n, 1 i g 
Tne f | HRI — Sie amt] a 


dt 16m on 
= etme) 7 (11.97a) 
dPko 
= jinn jin Q 
dt 32 f may 
2 (4) ‘mn 16 
_ 2p bhinn) 7 hn tt 11. 
63 ~ 45° J ete) 
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where J” and J”? are the mass octupole and current quadrupole moments, given to lowest 
PN order by 


inn = [oseres, 
Prag” / pxiv Pr Px. (11.98) 


Note that, at purely quadrupole order, the linear momentum change vanishes, because the 
number of unit vectors N/ in the angular integral will always be odd. To get the final 
expression in Eq. (11.97b) it is necessary to compute the 0.5PN corrections to the waveform 
h/*, resulting in the appearance of the higher-order multipole moments of Eq. (11.98). Fora 
binary system at lowest PN order, these loss rates become 

a= 73" (7) BP 3 422), 

dP 8 


ae gst ae [» (50? - 387 +8") — in (55-457 +12™)], (11.99) 


where A = x x vand A = (m2 — m,)/m. 

Substituting Keplerian formulae for the orbital variables and averaging over an orbit, we 
can obtain the orbit-averaged dE /dt and dJ/dt. Recalling that for the orbit, E = —nm?/2a 
and J = nm,/mp, where p = a(1 — e”), we obtain the rate of change of a and e. 
The results are 


a 
dJ 32 5 m\7/2 
da 15 m(=) (2) 


= H 11.10 
dt 15 ‘a\a (e), ( ) 
where 
73 37 
FA=a(—S HF inak e a 2 
()=(-ey (14 Fes el), 
G(e) = (1—e?)~? (: + :°) ; 
121 
H(e) = (1—e”)- 9? (1+ ee"). 11.101 
(= (1-2) 97 (14 se (11.101) 
These equations for da/dt and de/dt can be combined and integrated to yield 
‘i 2 \ 12/19 74 2\ (1+ 121e2/304 870/2299 
—=|[— 5 : (11.102) 
ao e 1-2 1 + 121e5/304 


where ao and eo are the initial values. Note that gravitational radiation causes the orbit 
to both shrink and circularize. For eg « 1, the evolution is given approximately by 
e & eo(a/ao)!*/'?. From da/dt we can obtain an expression for the rate of change of 
Py, given by 
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; 1927 (2 oe 
Raw ( ) Fle), (11.103) 
5 Pe 


where M is the chirp mass given by Eq. (11.86). Notice that by making precise 
measurements of the phase ®(r) = 27 [ ‘ft’)dt’ of either the orbit or the gravitational 
waves (for which f = 2f, = 2/P, for the dominant component) as a function of the 
frequency, one in effect measures the chirp mass of the system. The frequency of the orbit 
and of any emitted gravitational waves increases with time, varying as fy ~ (t — to) ~>/8 
for circular orbits, where fo is the merger time when the binary separation vanishes. The 
semimajor axis shrinks according to a ~ (t — fo)!/4, and thus the waveform amplitude 
increases ash ~ a! ~ (t— rh 4 In acoustics, a signal of increasing amplitude and 
frequency is known as a “chirp”, hence the name “chirp mass” for the quantity M that 
controls the process. 

From Eq. (11.91), the gravitational waveform for a binary system is given, to lowest PN 
order by (we drop the subscript V) 


hi = “um a - a , 
r 
4 
+ sus (1 — nin) , (11.104) 


where the second line is the circular orbit limit, and where r v, m and X are given by 
Eqs. (6.73). We now choose the basis in Figure 6.1 so that the X axis is along the line of 
nodes and also choose w = 0 for the pericenter angle. ee the fact that, for N in the Z 
direction, the projection tensor can be written P/* = eye, + e}e4, we can show that the 
transverse tracefree components nh, Eq. (11.95) become 


itty = (eel — eet) hy + (ehel + efek) hx, (11.105) 
where 


hy = a a [(A- ex)? — (A+ ey)” — (n- ey)? + (n- ey)”] 


2 
- = + cos”t) cos2¢, 


hy = ~= [(A- ex)(A- ev) — (a ex) (ev) | 


4nm 
= tym —cosvsin2¢. (11.106) 
It can also be shown that 
1d jk 
age (11.107) 
and thus the amplitudes in Eqs. (11.42) and (11.48) are given by 


Rojox = — 


2 
AL= la (1 + cos’s) cos 2¢, 
4 2 
Ay = — ne cosisin2d. (11.108) 
r 


with As = Ay Ay2 = AL 0. 
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The foregoing discussion has been for the Newtonian waveforms and fluxes, but results 
are now known to many post-Newtonian orders beyond these lowest order formula. As an 
illustration, we show the rate of change of energy for a circular orbit through 2PN order: 


=r (2) [2 (E+ br) v4 (2) 


my? (293383 380 
! ! 11.10 
z) ( 9072 9 ") | oy) 


The IPN corrections to the waveform and energy flux were calculated by Wagoner and 
Will (1976). The 1.5PN term 47(m/a)?/? is the contribution of tails, the result of scattering 
of the outgoing gravitational wave off the background curved spacetime produced by the 
source. This and the 2PN terms were calculated by Blanchet et al. (1995a, 1995b) and 
Will and Wiseman (1996). Through pioneering work by Blanchet and collaborators, these 
results are now known through 3.5PN order (see Blanchet (2014) for an up-to-date review). 
The leading effects of spins of the bodies on the waveform and fluxes were first calculated 
by Kidder et al. (1993) and Kidder (1995) (see Section 6.7 for the leading contributions 
to the equations of motion). Numerous higher-order spin contributions have also been 
calculated (see Blanchet (2014) for a review and references). 


11.5.3 Scalar-tensor theories 


Because we are interested in gravitational waves from systems containing compact bodies, 
we will combine the “relaxed” or Landau-Lifshitz formulation of the scalar-tensor field 
equations developed in Section 5.3.2 with the modified EIH formalism described in 
Section 10.3.2. 

The wave equation for the field h° in harmonic gauge has a form parallel to that of 
Eq. (11.87), 


hee = —160nGF , (11.110) 


where 7°° = (—g)(T°P + a + #2? + 7°) and G = G/do (see Section 5.3.2 
for definitions). Recall that quantities with tildes are defined using the conformally 
transformed metric $1 = (¢/¢0)gy. Because ae = 0 by virtue of the gauge condition 
on ho , we can follow the same steps as for general relativity, Eqs. (11.88)-(11.90) to find, 


in the far zone, that 
2 
hit = Tal eGR x) ee (11.111) 
M 


To lowest PN order, 7°° = p* = )>, m,6?(x — Xz), where m, is the mass of each body 
evaluated for the asymptotic value po of the scalar field, and we obtain 


jie 20 im Ry, (11.112) 


where J is the quadrupole moment tensor of the source (see Eq. (11.91)), and we have 
used units in which Gioday = G/(1 — ¢) = 1. We now turn to the scalar wave equation 
in its relaxed, compact body form, Eq. (10.55b). To lowest PN order, it is straightforward 
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to show that the equation takes the form OW = —87Cp*(1 — 2s), where the sensitivity s 
arises from the derivative 0T/0¢. Using the multipole expansion (11.88) in the far wave 
zone, and keeping the monopole and dipole terms, we obtain 


2 
Wa Dell 20) (Nott), (11.113) 


Dropping the constant term and specializing to a two-body system at rest at the origin of 
coordinates, we obtain 


v= se im(s2 — si)N-v. (11.114) 
Thus we see the presence of dipole gravitational radiation, generated by the differ- 
ence in sensitivity or gravitational binding energy, between the two bodies. We also 
see that Y ~ (m/R)v, while the leading quadrupole contribution for “tensor” waves is 
hik ~w (m/R)v’. According to our convention for labelling the orders of gravitational waves, 
for compact bodies, the dipole radiation contribution to WV is of —0.5PN order. Thus, 
in order to derive the full gravitational waveform to an order that is consistent with 
the Newtonian-order radiation of general relativity, we must also include the quadrupole 
moment contributions to W, as well as the 1PN corrections to the monopole moment, which 
could have time-varying contributions at order (m/R)v*. Furthermore, as we will see later, 
the energy flux in scalar-tensor theory is proportional to a sum of terms involving (ai; y 
and (W)*. This means that the leading dipole radiation term in W will contribute to the 
flux at —1PN order compared to quadrupole radiation. Therefore, in addition to including 
the O[(m/R)v’] terms in UV, we must also include (m/R)v> terms, i.e. contributions of 
0.5PN order, because in (W.9)*, there will be a cross-term between the —0.5PN dipole terms 
and the +-0.5PN term, leading to a OPN, or Newtonian-order term, at the same order as the 
quadrupole flux. Thus we must include the octupole moment in the scalar waveform as 
well as PN corrections to the dipole moment. In addition, in calculating the time derivatives 
of W for the energy flux, we must use 1PN equations of motion in the —0.5PN dipole term, 
as these will lead to contributions at +0.5PN order. 
For a compact binary system, the waveforms h/* and W are given to the required orders 
(Lang, 2014, 2015) by 


ee ag git (ot a nin) (11.115) 


where G = 1 — € + ¢(1 — 28,)(1 — 252) [see Eq. (10.65)], and 


aginst [Y_ospn + Yorn + Y+ospn] » (11.116) 
where 
WV_ospn = 2S_N-¥, 
Wopn = — (S; + AS_) Ea —(N-v + 


-28[s, - 2 (5,8, +5-3.)], 
r ye 
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U.05PN = : (AS; + (1 — 2n)S_) |. n)’(3¢+N-n—7N-v) + 2(v-¥)°| 


+ Sinn (SAS, +38.) —(W-») (AS, + 7S_) 
— Sw») (AS, + 3 —4n)S-) 
1 > Z - 2 
+422 (W.y— AN) =[A(SiBe +8 Ba) + (S284 S28) 
(11.117) 
where 7, 3, and 3_ are defined in Eqs. (10.71), and 
SSG Fly a1), 
Se0 Vil =a ei, (11.118) 


Note that 7 = —2¢(S7. — S*). 

Because of the presence of the scalar field, one can formulate a number of different 
conservation laws in scalar-tensor theory (Nutku, 1969; Lee, 1974). In the formulation 
used here, the conservation law Ta = 0 is related to the “tensor mass-energy” of the 
system, P®, given by 


Ph f és. (11.119) 
One can then show that 
dP® a 
eo ee 
ag sik 7 jk 
=~ jim = § [Ftc olttng + (6 + 4u0) Vo 0] a2. (11.120) 
co Tv 


Evaluating Eq. (11.119) to 1PN order, we find that Pe is given precisely by Eq. (11.81) for 
an N-body system and by Eq. (11.82) for a binary system. This is the total mass energy 
or the inertial mass of the system. As we discovered in Section 6.3, Eq. (6.59), in these 
theories, the inertial mass is not the same as the active gravitational, or Kepler-measured 
mass. Lee (1974) formulated a conservation law for the scalar mass Pe and from that 
defined a conservation law for the active or Kepler mass P{ = P!. + Pg. Even though the 
Kepler mass is considered the defining mass for an isolated system, the tensor mass can be 
directly related via Eq. (11.82) to measurable quantities such as the semimajor axis, which 
determines the orbital period and the frequencies of gravitational waves. Thus we will use 
the expression (11.120). Substituting Eqs. (11.115), (11.116), and (11.117) into (11.120) 
and integrating over solid angle, we obtain Eq. (11.59), with S = S_ and 


K1 = 12+ 57-5687 (3 +7428, +2AB_) 
S_ B+ “2. S4 By + s-) 
a I}, 


+ 10¢S_ ( 
7 


+ 10CAS_ ( 
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45 . = i 

Ko = 11+ 77-408, — SCS? [17+ 67 + + 884 + 8AZ_] 

S_By rset S484 =S7.) 
: 7 


+ 90¢S_ ( 
(S4 ts.) 
y ’ 


30CAS_ ( 


120¢ 
Kp = 4C. (11.121) 


These results are in complete agreement with the total energy flux to —1PN and OPN orders, 
as calculated by Damour and Esposito-Farése (1992a) in the DEF formulation of scalar- 
tensor theories, and with the energy loss derived from radiation reaction terms in the 
equations of motion (Mirshekari and Will, 2013). They disagree with the flux formula of 
Will and Zaglauer (1989) (see also Alsing et al. (2012)), who failed to take into account PN 
corrections to the dipole term induced by PN corrections in the equations of motion, and 
the dipole-octupole cross term in the scalar energy flux. To date, the analogous expressions 
for angular momentum change have not been calculated. 

From Eggs. (5.36), (5.40) and (5.41), we obtain the leading contributions to the physical 
metric in the far zone, 


I fone ai 
goo = —1+ 5 (A +i + 20) , 


80j = —h%, 
2, iT 7 z 
Sik = Sip th + <5, (h —h™ — 20) , (11.122 
>] J 9 J 
from which we obtain 
1@-, 1 wu 
Rook = — 3 aahitr + 5 (Sik — NiNe) (11.123) 


Substituting Eqs. (11.115), (11.116), and (11.117) and dropping constant terms, we obtain 
the amplitudes for a circular orbit 


Free te (1 + cos?z) cos2¢, 
4, e20= sis cosvsin2¢, 
As = 4Cnm|g'/?S_] (a) sinucos d 
+ 26m" gi (S, + AS_)] sin2ucos2¢. (11.124) 


It is worth noting that, for binary black holes, s; = s2 = 1/2, and thus S_ = S; = 0, 
G=1-¢,7 = 8 = 0 (also 4/7 = 0). Thus x; = 12, kp = 11, kp = O, and if we 
replace the mass m with m/(1—C), the waveform and energy flux become identical to those 
of general relativity, just as we found for the PN equations of motion in Section 10.3.2. 
For a black-hole neutron-star binary system, the energy flux parameters are G = | — ¢, 
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K, = 12—150/4, ky = 11—50(17+7)/4 and in the dipole flux term, KpS2 = QO, where 
Q=¢(1—¢)71(1 — 2s,)’, and s, is the sensitivity of the neutron star. 

These waveforms and fluxes were extended to 2PN order for the tensor waves (Lang, 
2014) and to 1PN order for the scalar waves (Lang, 2015). Completing the calculation of 
the scalar flux to 2PN order requires calculating the monopole and dipole moments of the 
scalar field to 3PN order, and knowing the equations of motion to 3PN order! 


11.5.4 Other theories 


Very few other theories of gravity have had their gravitational-wave predictions for 
compact binaries analyzed to comparable levels. Yagi et al. (2014a) found the Newtonian- 
order energy flux for compact binaries in Einstein-ther and Khronometric theories. They 
did not derive all the sensitivity-dependent contributions in the vector-field sector required 
for a complete and consistent solution, but they argued that the neglected terms would be 
negligible for the applications to binary pulsar data that they carried out. 

Many of the theories that were studied during the 1970s as either alternatives to general 
relativity or as “straw-man” examples, were found to predict dipole gravitational radiation. 
Even worse, in most cases the dipole radiation, and sometimes also the quadrupole 
radiation carried negative energy (Will, 1977). For example, Rosen’s bimetric theory 
predicted &) = —21/2, K2 = —23/2 and kp = —20/3, and the theory suffered a fatal 
blow when data showed that the orbit of the binary pulsar was shrinking, not expanding 
(Will and Eardley, 1977). 

Because of the real possibility of carrying out tests of alternative theories of gravity using 
recent detections of gravitational waves, the effort to calculate the gravitational radiation 
predictions in a range of alternative theories will surely intensify. 


Strong-Field and Dynamical Tests 


of Relativistic Gravity 


We now take the theoretical frameworks for discussing compact body dynamics and 
gravitational radiation that we developed in the preceding two chapters, and apply them 
to study current and future tests of general relativity in the strong-field and dynamical 
regimes. This effort began with the discovery of binary pulsars beginning in 1974, and in 
Section 12.1, we will discuss these remarkable systems and their implications for testing 
relativistic gravity. The detection of gravitational waves from an inspiraling binary black 
hole system in 2015 brought to fruition the ability to test gravitational theories in the 
strong-field dynamical regime using the gravitational waves themselves. In Section 12.2 
we will describe how information about the sources of the waves and about the underlying 
theory can be extracted from the detected signal, and will discuss the tests that have been 
carried out since 2015. Another kind of test of strong-field gravity involves exploring the 
spacetime near otherwise stationary compact objects using probes such as stars, small black 
holes and gas. Many of these tests are still works in progress; we will give some examples 
in Section 12.3. Finally, Section 12.4 will close this chapter with a brief description of 
cosmological tests. 


12.1 Binary Pulsars 
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The summer of 1974 was an eventful one for Joseph Taylor and Russell Hulse. Using the 
Arecibo radio telescope in Puerto Rico, they had spent the time engaged in a systematic 
survey for new pulsars. During that survey, they detected 50 pulsars, of which 40 were 
not previously known, and made a variety of observations, including measurements of 
their pulse periods to an accuracy of one microsecond. But one of these pulsars, denoted 
B1913+16,' was peculiar: besides having a pulsation period of 59 ms, shorter than that 
of any known pulsar except the one in the Crab Nebula, it also defied any attempts to 
measure its period to +1 jus, by making “apparent period changes of up to 80 ys, from 
day to day, and sometimes by as much as 8 jus, over 5 minutes” (Hulse and Taylor, 1975). 
Such behavior is uncharacteristic of pulsars, and Hulse and Taylor rapidly concluded that 
the observed period changes were the result of Doppler shifts due to orbital motion of 
the pulsar about a companion (for a popular account of the day-by-day detective work 


' At the time of the discovery, pulsars were denoted PSR, followed by the right ascension and declination. PSR 
was ultimately replaced by “B”, for pulsars whose coordinates were referred to the 1950.0 epoch. Most pulsars 
discovered since 1993 are labeled with “J”, have coordinates referred to the 2000.0 epoch, and include minutes 
in the declination. We will use the current conventions. 
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involved see Will (1986)). By the end of September, 1974, Hulse and Taylor had obtained 
an accurate “velocity curve” of this “single line spectroscopic binary.” The velocity curve 
was a plot of apparent period of the pulsar as a function of time. By a detailed fit of this 
curve to a Keplerian two-body orbit, they obtained the following elements of the orbit 
of the system: K, the semiamplitude of the variation of the radial velocity of the pulsar; 
Py, the period of the binary orbit; e, the eccentricity of the orbit; wo, the longitude of 
periastron at a chosen epoch (September 1974); a; sinv, the projected semimajor axis of 
the pulsar orbit, where v is the inclination of the orbit relative to the plane of the sky; and 
fi = (m7 sinv)> /(m; +mz)’, the mass function, where m, and my are the mass of the pulsar 
and companion, respectively. In addition, they obtained the “rest” period P, of the pulsar, 
corrected for orbital Doppler shifts at a chosen epoch. 

However, at the end of September 1974, the observers switched to an observation 
technique that yielded vastly improved accuracy (Taylor et al., 1976). That technique 
measures the absolute arrival times of pulses (as opposed to the period, or the difference 
between adjacent pulses) and compares those times to arrival times predicted using the 
best available pulsar and orbit parameters. The parameters are then improved by means of 
a least-squares analysis of the arrival-time residuals. With this method, it proved possible 
to keep track of the precise phase of the pulsar over intervals as long as six months 
between observations. This was partially responsible for the improvement in accuracy. 
Other improvements over the years included the sophisticated use of multi-wavelength 
data to suppress the effects of interstellar dispersion on the radio signals, the use of GPS 
time transfer to improve the timing precision, and a major upgrade of instrumentation at 
the Arecibo radio telescope between 1993 and 1997. The results of these analyses using 
data reported in 2016 are shown in column 3 of Table 12.1 (Weisberg and Huang, 2016); 
in Section 12.1.1 we will define the Keplerian and post-Keplerian parameters. 

The discovery of B1913+16 caused considerable excitement in the relativity community, 
because it was realized that the system could provide a new laboratory for studying 
relativistic gravity. Post-Newtonian orbital effects would have magnitudes of order 
vi ~ Ki ~ 5 x 1077, or m/r ~ fi/a;sine ~ 3 x 1077, a factor of ten larger 
than the corresponding quantities for Mercury’s orbit, and the shortness of the orbital 
period (~ 8 hours) would amplify any secular effect such as the periastron shift. This 
expectation was confirmed by the announcement in December 1974 (Taylor, 1975) that 
the periastron shift had been measured to be 4.0 + 1.5° yr~! (compare with Mercury’s 43 
arcseconds per century!), implying (see below) that the total mass of the system was about 
2.8 Mo. Moreover, the system appeared to be a “clean” laboratory, unaffected by complex 
astrophysical processes such as mass transfer. 

The pulsar radio signal was never eclipsed by the companion, placing limits on the 
geometrical size of the companion, and the dispersion of the pulsed radio signal showed 
little change over an orbit, indicating an absence of dense plasma in the system, as 
would occur if there were mass transfer from the companion onto the pulsar. These 
data effectively ruled out a main-sequence star as a companion: although such a star 
could conceivably fit the geometrical constraints placed by the eclipse and dispersion 
measurements, it would produce an enormous periastron shift (+5000 ° yr~') generated 
by tidal deformations due to the pulsar’s gravitational field. Other candidates for the 
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Table 12.1 Arrival-time parameters for binary pulsars, and their values in B1913+16. Numbers in 
parentheses denote errors in the last digit. Data taken from Weisberg and Huang (2016). 


Parameter Symbol (units) Value in B1913+16 
(i) Astrometric and pulsar parameters 

Right ascension a 19"15™27.599942(3) 
Declination 6 16°06'27."3868(5) 
Pulsar period P» (ms) 59.030003217813(11) 
Derivative of period Py 8.6183(3) x 1078 
(ii) Keplerian parameters 

Projected semimajor axis a) sine (s) 2.341776(2) 
Eccentricity e 0.6171340(4) 
Orbital period Py (day) 0.322997448918(3) 
Longitude of periastron wo(°) 292.54450(8) 

Julian date of periastron passage To (MJD) 52144.90097849(3) 
(iii) Post-Keplerian parameters 

Mean rate of periastron advance (w) (Cyr) 4.226585(4) 
Redshift/time dilation y' (ms) 4.307(4) 

Derivative of orbital period Py (107"”) —2.423(1) 

Range of Shapiro delay r (us) Qioae? 

Shape of Shapiro delay s= sint ONG Sian 


companion that were considered early on were a helium main-sequence star, a white dwarf, 
a neutron star and a black hole. Any of these would be consistent with the evolutionary 
models for binary systems of two massive stars that were popular at the time. In these 
models, one massive star evolves more rapidly, undergoing a supernova explosion and 
leaving a neutron star remnant. Mass transfer from the companion star serves to spin up 
the neutron star to its present 59 ms rotation period (this is the so-called pulsar recycling 
model, believed to be responsible for the class of millisecond pulsars). Subsequently, the 
massive companion star evolved rapidly, possibly undergoing its own explosion, leaving 
one of the four remnants listed earlier. 

The first two companion candidates, the helium star and the white dwarf, fell out of 
favor because of the complete absence of evidence for orbital perturbations that would be 
induced by tidal or rotational deformation effects. A black hole companion was disfavored 
when the observed pericenter advance indicated that the total mass of the system was about 
2.8 Ma; most evolutionary scenarios leading to black holes, and data on X-ray binaries 
containing black holes suggest that black holes in such systems are significantly more 
massive than 1.4 Mo. A neutron star remnant from a supernova explosion of the companion 
star was viewed as the most compatible with the near equality of the two masses, with 
the large orbital eccentricity (mass loss from the second supernova explosion can easily 
lead to disruption of the system), and with the utter orbital “cleanliness” of the system. 
In this scenario, the companion neutron star is the younger of the two. No evidence was 
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ever found for pulsed radiation from the companion, so it is either a pulsar whose signal 
does not intersect the Earth, or more likely is a dead pulsar, having spun down to a stage 
where it is no longer able to generate the magnetic fields required to emit radio waves. The 
main pulsar is an old, weakly magnetized, recycled pulsar, with very weak radio emission 
(indeed it was barely above the detection threshold set by Hulse in 1974), and a very low 
spin-down rate. 

Within weeks of the discovery, it was recognized that the system might be a testing 
ground for gravitational radiation damping (Wagoner, 1975; Damour and Ruffini, 1974). 
The observable effect of this damping is a secular decrease in the period of the orbit (see 
Section 11.5). However, the timescale for this change, according to general relativity, is so 
long (~ 10° yr) that it was thought that 10-15 years of data would be needed to detect 
it. However, with improved data acquisition equipment and continued ability to “keep in 
phase” with the pulsar with the arrival-time method, Taylor and his collaborators surpassed 
all expectations, and in December 1978 they announced a measurement of the rate of 
change of the orbital period in an amount consistent with the prediction of gravitational 
radiation damping in general relativity (Taylor et al., 1979; Taylor and McCulloch, 1980). 
In 1993, Hulse and Taylor were awarded the Nobel Prize in Physics for this discovery. 

In 1990, two more binary pulsars were detected, one eerily similar to B1913+16 in its 
orbital parameters. Today, more than 220 binary pulsars have been detected, of which 70 
have orbital periods shorter than one day (the shorter the period, the more relativistic the 
system). These include the famous “double pulsar,’ where, for a time, pulses from both 
neutron stars were also detected; several pulsars with a white dwarf companion; and most 
recently, a pulsar in a triple system, orbited by two white dwarfs. There are also a number of 
pulsars with planets revolving around them; while these are fascinating in their own right, 
they are less interesting for relativity. We will describe the more important denizens of this 
zoo of binary pulsars and their implications for testing relativistic gravity in Section 12.1.2, 
but first we turn to the arrival-time analysis for studying these systems. 


72.1.1 Arrival-time analysis for binary pulsars 


The analysis and interpretation of data from binary pulsars is based on an “arrival-time” 
framework, originally carried out by Blandford and Teukolsky (1976) and extended by 
Epstein (1977), Haugan (1985), Damour and Deruelle (1986) and Damour and Taylor 
(1992). Here we present a simplified version that illustrates the basic parameters that can 
be measured, and how they can be interpreted in order to test theories of gravity. 

We begin by setting up a suitable coordinate system (see Figure 12.1). We choose 
quasi-Cartesian coordinates (tf, x) in which the physical metric is of post-Newtonian order 
everywhere, except in the neighborhood and interior of the pulsar and its companion (if 
the latter is also compact), and is asymptotically flat. The origin of the coordinate system 
coincides with a suitably chosen center of mass of the binary system, assumed to be at rest 
with respect to the solar system. The reference X — Y plane is the plane of the sky, with the 
X-axis conventionally pointing toward the North celestial pole. The observer is chosen 
to be at great distance on the negative Z-axis. The orbit of the pulsar is characterized 
by the standard osculating orbit elements described in Section 6.4.2, and illustrated in 
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To observer 
Z Orbit of companion 
or= Ue « 


ae 


Orbits of a binary pulsar system. 


Figure 12.1. However, instead of using the definitions of Eq. (6.73) based on the true 
anomaly f, we use definitions based on the so-called “eccentric anomaly” u (see PW, 
section 3.2.4 for detailed discussion), which is more useful in an arrival-time analysis. 
It is directly related to time by the equation 


u—esinu =n(t— Tp), (12.1) 


where n is the “mean anomaly,” defined by n = (Gm/a?)!/2 = 27/Py, where a is the 
semimajor axis, Pp is the orbital period of the binary, and 7p is the time of pericenter 
passage. We include the modified EIH parameter G, because that parameter (including 
possible contributions from the sensitivities of the compact bodies) will govern the quasi- 
Newtonian limit used to describe the osculating elements in a given theory. The true and 
eccentric anomalies are related by the equations 


_ /l—esi 
cos f= — , sinf= ites ate (12.2) 
1 —ecosu 1 —ecosu 


With the center of mass X = (mx, + mx2)/m chosen to be at rest at the origin, the orbits 
of the pulsar (1) and companion (2) are given by 
m2 m, 
x}=—-X, %=-—x, (12.3) 
m m 


where x = x; — X2 is given by 
x = al (cosu—eep + v1 — sinueo| . (12.4) 


where ep and eg are unit vectors defined by Eq. (8.29); they point in the direction of the 
pericenter of body 1 and perpendicular to that vector within the orbital plane (Figure 12.1). 
Note that the unit vector in the direction of the orbital angular momentum is given by 
h=ep x €g. The distance between the two bodies is given by r = a(1 — ecosu). 
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Any perturbation of the two-body system will induce periodic and secular changes in the 
orbit elements that can be calculated using the Lagrange planetary equations (6.74). 

We next consider the emission of the radio signals by the pulsar. Let 7 be proper time as 
measured by a hypothetical clock in an inertial frame that is momentarily comoving with 
respect to the pulsar. The time of the Nth rotation of the pulsar is given in terms of the 
rotation frequency v of the neutron star by 


1 1 
N=N+vr4 sr eer eae (12.5) 


where Np is an arbitrary integer, and » = dv /dr|,—0, v = d’v/dr?|,=0. We will ignore the 
possibility of discontinuous jumps, or “glitches,” in the rotation frequency of the pulsar, or 
of drifts of the phase of the radio beam with respect to the neutron star’s rotation phase. 
These are complicated issues that are still not fully understood. We ultimately wish to 
determine the arrival time of the Nth pulse on Earth. 

Outside the pulsar and its companion, the metric in our chosen coordinate system 
is given by the modified EIH metric, Eq. (10.28). Because we are interested in the 
propagation of the radio signal away from the system, we will ignore the possibility of large 
strong-field corrections to the metric in the close vicinity of the pulsar or its companion. 
The main result of such effects will be either to add a constant term in the emission time 
formula (12.5) that can be absorbed into the arbitrary value of No, or to multiply terms 
by a constant factor, such as the redshift at the surface of the neutron star, that can be 
absorbed into the unknown intrinsic value of v. Modulo such factors, proper time 7 in the 
local comoving inertial frame, evaluated at the pulsar’s center of mass, can be related to 
coordinate time ¢ by 

dr = dt 1 - aj - mm + o(e)| : (12.6) 
where we have dropped the constant contribution of the pulsar’s gravitational potential 
at the point in the inertial frame chosen to be the point of “emission” of the signal, 
aym/|Xem — X1|, and have ignored the difference between the velocity of the emission 
point and the pulsar’s center of mass x;. The two correction terms in Eq. (12.6) are the 
gravitational redshift and the second-order Doppler shift, or time dilation. 

Using Eqs. (12.1) and (12.4), it is simple to show that v7 = G(m3/m)(2/r — 1/a), and 
thus that 


dr ,_ mm a3 + Gm2/m | Gm . (12.7) 
dt a 1-—ecosu 2ma 


Integrating with respect to ¢, we obtain 


a [! i (a3 | Gx) | te die (03 ! Gg) tng taat, C98) 
a 2m a m/ 20 

where u and uo are the values of the eccentric anomaly at t = ¢, and t = 0, respectively. 

The factor multiplying ¢, can be absorbed into the definition of the frequency v, and the 

constant term involving uo can be dropped. Although these constants may actually undergo 

secular or periodic variations in time due to orbital perturbations or other effects, these will 
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be small perturbations of post-Newtonian corrections, and will have negligible effect. Thus 
we obtain 
m2 * m2 Py F 
og (a3 ~ =)  sinu. (12.9) 
a m/ 20 
After emission, the pulsar signal travels along a null geodesic. We can therefore use the 
method in Sections 6.1 and 7.2 to calculate the coordinate time taken for the signal to travel 
from the pulsar to the solar system barycenter xo, with the result 


2 : 2r larr 
larr — le = |X0 (Carr) Xx (te) | a (03 me mg = E . 


(te) +n: X(te) er 


where ro = |xo|, 2 = X0/ro and we have used the fact that ro >> 7). The second term in 
Eq. (12.10) is the Shapiro time delay of the pulsar signal in the gravitational field of the 
companion. The time delay due to the pulsar’s own field is constant to the required accuracy 
and has been dropped. The effect of the companion’s motion during the propagation of the 
signal across the orbit is a higher post-Newtonian effect, and thus has been ignored. 

In practice, one must take into account the fact that the measured arrival time is that 
at the Earth and not at the barycenter of the solar system, and will therefore be affected 
by the Earth’s position in its orbit and by its own gravitational redshift and Doppler-shift 
corrections. In fact, it is the effect of the Earth’s orbital position on the arrival times that 
permits accurate determinations of the pulsar’s position on the sky. It is also necessary to 
take into account the effects of interstellar dispersion on the radio signal. These effects can 
be handled in a standard manner and will not be treated here. 

Now, because rp >> 7, we may write 


Ix (tar) — X1(te)| = ro(tar) —X1(te) m+ O(r/ro). (12.11) 


Combining Eqs. (12.10) and (12.11), resetting the arrival time by the constant offset ro, 
evaluating r and x, at far, instead of at ¢,, and dropping higher-order terms, we obtain 


te = tare + (X1 (tar) 2) (1 + 1 (tan) «2) — Atg (torr) 5 (12.12) 


where Afg is the Shapiro term in Eq. (12.10). 

We can now combine Eqs. (12.9) and (12.12) and substitute the result into Eq. (12.5) 
to relate t:, to the pulse number NV. Here we must include another effect that can occur 
in theories of gravity that violate SEP (Eardley, 1975), whereby the local gravitational 
constant at the location of the pulsar may depend on the gravitational potential of the 
companion, that is, 


i= (1 = nm) (12.13) 


where 7* is a parameter that depends on the theory, and possibly on the sensitivities of the 
two bodies. As G, varies during the orbital motion, the structure of the pulsar, its moment 
of inertia, and thus its intrinsic rotation rate will vary, according to 


Av Al AG, 


yh LG, 


+ M2 
=-Kymi, (12.14) 
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where #1) is the sensitivity of the moment of inertia of body 1 to variations in G;. Thus 
Eq. (12.5) should be rewritten 


1 1 
N=Nytor+ f Avar ar gue ses (12.15) 
where to the necessary order, and modulo constants, 
P 
[aver = —veayns 2 Ze sinu, (12.16) 
a 20 


and where we have not applied a similar correction to the smaller terms involving v and v. 
Substituting Eqs. (12.9), (12.12), and (12.16) into (12.15), and dropping the constant No 
we obtain the timing formula 


1 1 
tar = V~'N — Ap(u) — Ag(u) + As(u) sy Nt - giv iN +..., (12.17) 
where the three “delay” terms are given by 


Ar(u) = xF(e, w, u) [1 + xF(e, W, u)| ; 
Ag(u) = ¥' sinu, 
As(u) = 2rIn[l — ecosu — sF(e,w,u)] , (12.18) 


where 


: m, 
x =a,sinu = —asin, 
m 


Ps 1/3 fies 
“i Sem (5.) (a3 + G7 + rym) : 


1 * * 
5 (03 +73), 
s= sine. (12.19) 


— 


The function F(e, w, uv) is given by 
F(e,w,u) = sinw(cosu—e) + cosw(1 —e”)!/? sinu, (12.20) 
where F = dF /dt, and wu is related to tar by 
u—esinu = N(tar — To). (12.21) 


Equation (12.17) gives tar; in terms of the pulse number N, a set of orbital and relativistic 
parameters, and the intrinsic spin parameters v, / and v. From an initial guess for the 
values of these parameters, a prediction for the arrival time of the Nth pulse can be made. 
The difference between the predicted arrival time and the observed arrival time is then 
used to correct the parameters using a suitable parameter estimation technique, such as 
least-squares. 

The term Ag in Eq. (12.17) is called the “Roemer” delay. It is simply the shift in arrival 
time caused by the change in the pulsar’s location relative to the center of mass. The 
amplitude is controlled by the parameter x, the projected semimajor axis of the pulsar on 
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the plane of the sky. The evolution of this term with respect to f,,, is governed by the orbital 
period Py, the eccentricity e, the pericenter angle w and the pericenter time 7) . Because 
this is a purely geometric effect coupled with Newtonian gravity, these five parameters are 
known as the “Keplerian” parameters of the system, as listed in Table 12.1. This is the 
time-domain version of the standard technique for determining orbits in Newtonian single- 
line spectroscopic binary systems whereby one measures the Doppler shift of the pulse 
period or spectral line. Notice that the combination 


2 ; 
fi=x (=) = gos. (12.22) 


m 


is the standard “mass function” of spectroscopic binaries. 

The Roemer delay is the dominant correction, of order r/7 ~ v ~ €!/. The Einstein and 
Shapiro terms are of order ¢. Thus, if the orbit elements a, e, w, » experience any secular 
changes, they will be observable in the Roemer term, if at all (a change in the angle of 
nodes 2 will be unobservable, as it merely rotates the system about the line of sight). It is 
conventional to define 


w =wot (w)(t—to)+..., 


Py = Pro + sPolt —to)+...5 (12.23) 
where wo and Pyo and their time derivatives are defined at a chosen epoch fg. Note that 
the factor 1/2 in the expression for P, comes from the formal definition of P, in terms 
of the semimajor axis a. One then substitutes Eqs. (12.23) into Ap and includes w and 
P, as parameters to be estimated. The parameters wW and P, are two of the post-Keplerian 
parameters listed in Table 12.1. 

In principle one could include secular variations in e and 1; such variations have 
recently been detected in B1913+16 (Weisberg and Huang, 2016), resulting from a spin- 
orbit induced precession of the orbital plane. Furthermore, as we saw in Sections 8.2 
and 8.4, and will reiterate below, it has been possible to search for periodic varia- 
tions in these parameters in some systems, and to place important limits on violations 
of SEP. 

The term Ag is the Einstein term, also called the Redshift/time dilation term, with 
amplitude 7’. The parameter y’ (not to be confused with the pPpN parameter ) is another 
post-Keplerian parameter. Note that, once e and Py are determined from the Roemer term, 
it depends only on the masses of the bodies and possibly on their sensitivities, through a3, 
G, Ky, and 75. 

The final relativistic term is the Shapiro delay As, dependent upon the “range” parameter 
r, and the “shape” parameter s. The former depends on the mass of the companion and 
possibly on sensitivities, while the latter is simply sinv. 

The final two post-Keplerian parameters in our discussion are (w) and Py, whose 
predicted forms are given by Egs. (10.49) and (11.85). Box 12.1 summarizes these 
predictions for the post-Keplerian parameters. 


12.1 Binary Pulsars 


Predictions for post-Keplerian parameters 


Here we summarize the predictions for the post-Keplerian parameters in alternative theories of gravity. 
Redshift/time dilation: 


Py ve mM 
4/ =em (4) (a3 af Gis ob rant) ; 
Shapiro delay range: 
1 ae oc 
Hie 5 (0 + 7; )m. 
Shapiro delay shape: 
S= Siz. 


Rate of advance of periastron 


ie 67 27Gm a -9 
) = a ( a") Po 


Derivative of orbital period 


5/3 
| ca (=e) F(e) — denon S? (= ) G(e). (12.24) 
5 Po iy 
The coefficients a5 and +; appear in the post-Newtonian metric of body 2 [Eqs. (10.28)], while the parameter 
ny and the sensitivity +) refer to the effect of the field of body 2 on the moment of inertia of body 1 
[Egs. (12.13)—(12.16)]. The quantity G appears in the quasi Newtonian limit of the modified EIH equations 
of motion, a = —Gmx/r’, while P is given by Eq. (10.48). The functions F(e) and G(e) are given by 
Eq. (11.84); «pis the dipole radiation parameter of the theory, and S is related to the difference in sensitivities 

between the two bodies. 
In general relativity, the post-Keplerian parameters simplify to 


aia, 
§ = She, 
: _ 67 27m ay 
Ne Py(1 = e-) Py d 
: 1920 (2 ae 
poe neal igi F(e), (12.25) 
5 Py 


where 


ee sl 
F(e) = (1—e)~7? (: ee Til at =) : (12.26) 
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The formulae for P, include gravitational radiation contributions only through 
quadrupole order. They ignore other sources of energy loss, such as tidal dissipation, mass 
loss from the system, energy loss from the pulsar emission or via magnetic interactions 
between the two bodies. To date, there has not been an example of a binary pulsar system 
where these mechanisms come close to the energy loss via gravitational radiation. If such 
an system were found, it would very likely be deemed too “dirty” to provide quantitative 
tests of relativistic gravity, though it might well yield interesting physics of other types. 

However there is a contribution to P, that cannot be ignored. If the center of mass of the 
binary system is accelerating relative to that of the solar system, then both the orbital and 
pulsar periods will change at a rate given by 

Fm mya = [P(e], (12.27) 
where v and a are the relative velocity and acceleration, respectively, between the binary 
system and the solar system. The first term is the projection of the acceleration along the 
line of sight, while the second, called the Shklovsk1i effect, represents the effect of variation 
of the line of sight. If we assume that the binary system (b) and the solar system (©) are 
on circular orbits around the galaxy with angular velocities , and Qo, distances from the 
galactic center 7) and re, and longitudes relative to the galactic center dp and de, then 
Eq. (12.27) takes the form 

1 78 a, Gey? VO (cos - ue sin?) (12.28) 
Py Py 5 ro re > ‘ 


where ¢ = dp — de. We will see that this effect is significant for some binary pulsars, such 
as B1913+16, but not important for others, such as the double pulsar, depending on their 
distance from the solar system and precise location in the galaxy. 


12.1.2 A zoo of binary pulsars 


Here we describe some of the most interesting binary pulsar systems from the point of 
view of testing relativistic gravity. For the most part, we will discuss the implications of 
the system for general relativity. In Section 12.1.3, we will discuss bounds placed by binary 
pulsar observations on various alternative theories of gravity. 


The Hulse-Taylor binary pulsar B1913+16 


The measured values of (i), 7/ and P, shown in Table 12.1 provide three constraints on the 
two masses m, and mp), given by Eqs. (12.25) in Box 12.1. It is conventional to plot these 
three constraints, including their uncertainties, on an mm, — mz plane. If general relativity 
is correct, they must overlap at a single point, within the uncertainties. The plot is shown 
in the left panel of Figure 12.2. The (w) constraint fixes the total mass of the system to be 
2.8284 Mo, accurate to about a part in 10°; this is the black line in Figure 12.2, with the 
width of the line much larger than the actual uncertainty. The 7’ constraint fixes the quantity 
(m2 /m)(1+m2/m); this is the dark grey line in Figure 12.2. From the intersection of these 
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Left: m; — m; plot for B1913+16. Right: Time of periastron passage as a function of time from 1975 to 2013. 
The curve is the prediction of general relativity for P, given by Eq. (12.25); the points are the measured values 
with error bars multiplied by a factor of 400. The large gap in data during the middle 1990s occurred during the 
upgrade of the Arecibo Radio Telescope. Image reproduced with permission from Weisberg and Huang (2016), 
copyright by AAS. 


two constraints, we obtain the values for the individual masses, m, = 1.438 + 0.001 Mo 
and mz = 1.390 + 0.001 Mo (the error is dominated by the uncertainty in 7’). 

The P, constraint fixes the chirp mass of the system M = 1° /m: because m is known, 
this constrains 7. However, in this case, the galactic acceleration effect is important. 
Using data on the location and proper motion of the pulsar, combined with the best 
information available on galactic rotation; the current value of this effect is ca = 
—(0.025 + 0.004) x 10~!?. Subtracting this from the measured post-Keplerian parameter 
P, shown in Table 12.1 gives the corrected value Pe" = — (2.398 + 0.004) x 107, 
This value and its uncertainty are used to plot the dP,/dt curve in Figure 12.2. It is 
the hyperbolic curve shown in the inset in Figure 12.2 which ranges from zero to three 
solar masses, and the light grey band parallel to the (w) constraint in the blow-up of the 
intersection point “a,” which ranges over about 0.05 Me. The three curves overlap at a 
common point, within the uncertainties. 

Another way to check the agreement with general relativity is to use the masses inferred 
from the intersection of the (w) and y’ constraints along with the expression for Py in 
Eq. (12.25) to predict the value pe = —(2.40263 + 0.00005) x 107!*. This agrees with 
the measured value after the galactic correction, in other words, 


pcoor 


GR = 0.9983 + 0.0016. (12.29) 


b 


Although the uncertainties in the measured post-Keplerian parameter P, continue to 
decrease, in part because of the decrease of statistical errors with observation time 7 as 
T~3/2, the uncertainties in the parameters that go into the galactic correction are now 
the limiting factor in the accuracy of the test of gravitational wave damping in general 
relativity. In fact, if one assumes that general relativity is correct, then the binary pulsar is 
providing improved data on the galactic rotation curve. 
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A third way to display the agreement with general relativity is to compare the observed 
phase of the orbit with a theoretical template phase as a function of time. If P, varies 
slowly in time, then to first order in a Taylor expansion, the orbital phase is given by 
®,(t) = 2rt/Pyo — mPyot? /P2, +... . The time of periastron passage 7p is given by 
®(T7)) = 27N, where N is an integer. Consequently, the periastron time will not grow 
linearly with NV. Thus the cumulative difference between periastron time 7) and NPyo, the 
quantities actually measured in practice, should vary according to 

Ty — NP yp 2° NP ew 2b 


= 12.30 
oP, 2 Pro a 


The right panel of Figure 12.2 shows the comparison between the measured times and the 
predicted times of periastron passage. Even after a nearly five-year gap in observations 
while the Arecibo radio telescope was undergoing a major upgrade, the measured 
periastron times landed right on top of the predicted curve. 

The consistency among the constraints provides a test of the assumption that the 
two bodies behave as “point” masses, without complicated tidal effects, obeying the 
general relativistic equations of motion including gravitational radiation. This supports 
the evolutionary model whereby the pulsar is an old recycled pulsar and the companion is 
a young but dead pulsar. It is also a test of the adherency of general relativity to the SEP in 
the presence of strong gravity, in that the highly relativistic internal structure of the neutron 
stars does not influence their orbital motion. 

Observations indicate that the pulse profile is varying with time (Kramer, 1998; Weis- 
berg and Taylor, 2002), which suggests that the pulsar is undergoing geodetic precession on 
a 300-year timescale as it moves through the curved spacetime generated by its companion 
(see Section 9.1.1). The amount is consistent with GR, assuming that the pulsar’s spin 
is suitably misaligned with the orbital angular momentum. Unfortunately, the evidence 
suggests that the pulsar beam may precess out of our line of sight by 2025. 

The precession of the pulsar’s spin is accompanied by a precession of the orbital plane, 
since the total angular momentum of the system is constant up to the changes induced 
by gravitational radiation reaction. One consequence of this is that the orbital inclination 
has increased enough that, with the aid of improved measurement accuracy, the Shapiro 
delay has become measurable (see Table 12.1). Another consequence is that variations in 
the projected semimajor axis x and eccentricity e must now be included in the analysis, 
leading to somewhat larger errors in the redshift/time dilation parameters +’ than those 
presented in earlier analyses (see, e.g., Weisberg et al. (2010)). 


The “double” pulsar J0737-3039A,B 


This binary pulsar system, discovered by Burgay et al. (2003), was already remark- 
able for its extraordinarily short orbital period (0.1 days) and large periastron advance 
(16.8995° yr—'), but then the companion was also detected as a pulsar (Lyne et al., 2004). 
Because two projected semimajor axes could be measured, the mass ratio was obtained 
directly and to high precision from the ratio of the values of the Keplerian parameters 
xX, = a, sine and x2 = ay sinu, since x; /x2 = m2/m, modulo post-Newtonian corrections 
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Mass-mass plots for the double pulsar. Image courtesy of Michael Kramer. 


(see Table 12.2). In Figure 12.3, the line labeled “R” is that mass ratio, and the white 
area is the allowed region for the masses, set by the measurement of the two Keplerian 
mass functions fj = x}(27/P,)? and f = x3(27/P,)* combined with the condition 
that |sinz| < 1. Then, the individual masses were obtained by combining the mass 
ratio with the periastron advance, assuming general relativity to be valid. The results 
are my = 1.3381 + 0.0007 Mo and mg = 1.2489 + 0.0007 Mo, where A denotes the 
primary (first) pulsar. From these values, one finds that the orbit is nearly edge-on, with 
sint = 0.9997, a value which is completely consistent with that inferred from the Shapiro 
delay shape parameter s = sin. In fact, the five measured post-Keplerian parameters plus 
the ratio of the projected semimajor axes give six constraints on the masses. As seen in 
Figure 12.3, all six overlap within their measurement errors (Kramer et al., 2006). The fact 
that the overlap region, shown in detail in the inset, is so close to the vertex of the allowed 
region is another indication that the orbit is nearly edge-on. Note that Figure 12.3 is based 
on more recent data than that quoted in Kramer et al. (2006), described in this discussion 
and listed in Table 12.2. 

Because of the location of the system, galactic proper-motion effects play a significantly 
smaller role in the interpretation of Py measurements than they did in B1913+16; this 
and the reduced effect of interstellar dispersion means that the accuracy of measuring the 
gravitational-wave damping will eventually beat that from the Hulse-Taylor system. It may 
ultimately be necessary for the data analysis to include 2PN corrections, for example in the 
periastron advance. 
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Table 12.2 Parameters of other binary pulsars. Values for Py, include corrections for galactic kinematic 
effects. Numbers in parentheses denote errors in the last digit. See the text for references. 


Parameter J0737—3039(A,B) J1738+0333 J1141-6545 
(i) Keplerian 

ay Sint (S) 1.415032(1)/1.516(2) 0.343429 13(2) 1.858922(6) 
e 0.0877775(9) (3.441.1) x 1077 0.171884(2) 
Py (day) 0.10225156248(5) 0.354790739872(1) 0.1976509593 (1) 
(ii) Post-Keplerian 

(w) (° yr) 16.8995(7) 5.3096(4) 
7’ (ms) 0.386(3) 0.77(1) 

P, (107) —1.25(2) —0.0259(32) —0.401(25) 
r (ss) 6.2(3) 

s= sine 0.9997(4) 


The geodetic precession of pulsar B’s spin axis has also been measured by monitoring 
changes in the patterns of eclipses of the signal from pulsar A as it passes through the 
magnetosphere surrounding pulsar B, with a result in agreement with general relativity 
to about 13 percent (Breton et al., 2008). The constraint on the masses from that effect, 
denoted Qgo is also shown in Figure 12.3. In fact, pulsar B has precessed so much that its 
signal no longer sweeps by the Earth, so it has gone “silent.” For a recent overview of the 
double pulsar, see Burgay (2012). 


J1738+-0333: A white-dwarf companion 


This is an ultra-low-eccentricity, 8.5-hour period system in which the white-dwarf 
companion is bright enough to permit detailed spectroscopy, allowing the companion mass 
to be determined directly to be 0.181 Me. The mass ratio is determined from Doppler 
shifts of the pulsar signal and of the spectral lines of the companion, giving the pulsar 
mass 1.46 Mo. Ten years of observation of the system yielded both a measurement of the 
apparent orbital period decay, and enough information about its parallax and proper motion 
to account for the substantial galactic effect to give a value of the intrinsic period decay of 
Py = (—25.9 + 3.2) x 107 in agreement with the predicted effect of general relativity 
(Freire et al., 2012). But because of the asymmetry in the sensitivities of the bodies in 
the system, the result also places a significant bound on the existence of dipole radiation, 
predicted by many alternative theories of gravity. Data from this system were also used 
to place a tight bound on the PPN parameter a (see Section 12.1.3 for discussion of these 
tests of alternative theories). 


J1141--6545: A white-dwarf companion 


This system is similar in some ways to the Hulse-Taylor binary: short orbital period 
(0.2 days), significant orbital eccentricity (0.172), rapid periastron advance (5.3° yr—!) and 
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massive components (m, = 1.27 40.01 Mo, me = 1.02 + 0.01 Ma). The key difference is 
that the companion is again a white dwarf. The intrinsic P, has been measured in agreement 
with general relativity to about 6 percent, again placing limits on dipole gravitational 
radiation (Bhat et al., 2008). 


J0348+0432: The most massive neutron star 


Discovered in 2011 (Lynch et al., 2013; Antoniadis et al., 2013), this is another neutron- 
star white-dwarf system, in a very short-period (0.1 day), low-eccentricity (2 x 10~°) orbit. 
Timing of the neutron star and spectroscopy of the white dwarf have led to mass values 
of 0.172 Me for the white dwarf and 2.01 + 0.04 Me for the pulsar, making it the most 
massive accurately measured neutron star yet. This supported an earlier discovery of a 
2 Mo pulsar (Demorest et al., 2010); such large masses rule out a number of heretofore 
viable soft equations of state for nuclear matter, assuming general relativity to be correct. 
The orbit period decay agrees with the general relativistic prediction within 20 percent and 
is expected to improve steadily with time. 


J0337+1715: Two white-dwarf companions 


This remarkable system was reported by Ransom et al. (2014). It consists of a 2.73 mil- 
lisecond pulsar of 1.4378(13) Mo, with extremely good timing precision, accompanied by 
two white dwarfs in coplanar circular orbits. The inner white dwarf (m = 0.19751(15) Mo) 
has an orbital period of 1.629 days, with e = 6.9177(2) x 107‘, and the outer white dwarf 
(m = 0.4101(3) Mo) has a period of 327.26 days, with e = 3.5356196(4) x 1077. This is 
an ideal system for testing the Nordtvedt effect in the strong-field regime. Here the inner 
system is the analogue of the Earth-Moon system, and the outer white dwarf plays the role 
of the Sun. Because the outer semimajor axis is about | /3 of an astronomical unit, the basic 
driving perturbation is comparable to that provided by the Sun on the Earth—-Moon system. 
However, the self-gravitational binding energy per unit mass (sensitivity) of the neutron 
star is almost a billion times larger than that of the Earth, greatly amplifying the potential 
size of the Nordtvedt effect. In 2018, Archibald et al (Nature, in press) reported a bound 
of 2.6 parts per million on any difference in acceleration between the neutron star and the 
white dwarf, representing an improvement over lunar laser ranging of about a factor of 10. 


Other binary pulsars 


Two of the earliest binary pulsars, B1534+12 and B2127+11C, discovered in 1990, failed 
to live up to their early promise despite being similar to the Hulse-Taylor system in most 
respects (both were believed to be double neutron-star systems). The main reason was the 
significant uncertainty in the kinematic effect on Py of local accelerations, that of the galaxy 
in the case of B1534+12, and that of the host globular cluster in the case of B2127+11C. 

A class of wide-orbit binary millisecond pulsar (WBMSP) systems, containing a pulsar 
and a white dwarf in low-eccentricity orbits has been used to place interesting bounds on 
the Nordtvedt effect. See Section 8.1 for a discussion. 
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12.1.3 Tests of alternative theories 


Soon after the discovery of the binary pulsar B1913+16 it was widely hailed as a new 
testing ground for relativistic gravitational effects. As we have seen in the case of general 
relativity, in most respects, the system has lived up to, indeed exceeded, the early expecta- 
tions. In many ways, the double pulsar topped the Hulse-Taylor binary in this regard. 

In another respect, however, B1913+16 only partially lived up to its promise, namely as 
a direct testing ground for alternative theories of gravity. The origin of this promise was 
the discovery (Eardley, 1975; Will, 1977) that alternative theories of gravity generically 
predict the emission of dipole gravitational radiation from binary star systems. As one 
fulfillment of this promise, Will and Eardley (1977) worked out in detail the effects of 
dipole gravitational radiation in the bimetric theory of Rosen (1974), and when the first 
observation of the decrease of the orbital period was announced in 1979, the Rosen theory 
suffered a fatal blow. A wide class of alternative theories of that period also failed the 
binary pulsar test because of dipole gravitational radiation (Will, 1977). 

On the other hand, the early observations of B1913+16 already indicated that, in general 
relativity, the masses of the two bodies were nearly equal, so that, in theories of gravity 
that are in some sense “close” to general relativity, dipole gravitational radiation would not 
be a strong effect, because of the apparent symmetry of the system. The Rosen theory, and 
others like it, are not “close” to general relativity, except in their predictions for the weak- 
field, slow-motion regime of the solar system. When relativistic neutron stars are present, 
theories like these can predict strong effects on the motion of the bodies resulting from their 
internal highly relativistic gravitational structure (violations of SEP). As a consequence, 
the masses inferred from observations of the periastron shift and 7’ may be significantly 
different from those inferred using general relativity, and may be different from each other, 
leading to strong dipole gravitational radiation damping. 

By contrast, the Brans-Dicke theory is close to general relativity, roughly speaking 
within 1 /wgp of the predictions of the latter, for large values of the coupling constant wgp. 
Thus, despite the presence of dipole gravitational radiation, the Hulse-Taylor binary pulsar 
provides only a weak test of pure Brans-Dicke theory, not competitive with solar-system 
tests. 

However, the discovery of binary pulsar systems with a white dwarf companion, such 
as J1738+0333, J1141-6545, and J0348+0432 has made it possible to perform strong 
tests of the existence of dipole radiation. This is because such systems are necessarily 
asymmetrical, since the gravitational binding energy per unit mass, or sensitivity of white 
dwarfs is of order 10~*, much less than that of the neutron star. Already, significant bounds 
have been placed on dipole radiation using J1738+0333 and J1141-6545. 

Because the gravitational-radiation and strong-field properties of alternative theories 
of gravity can be quite different from those of general relativity and each other, it is 
difficult to parametrize these aspects of the theories in the manner of the ppN framework. 
In addition, because of the generic violation of the Strong Equivalence Principle in these 
theories, the results can be very sensitive to the equation of state and mass of the neutron 
star(s) in the system. In the end, there is no way around having to analyze every theory in 
turn. On the other hand, because of their relative simplicity, scalar-tensor theories provide 
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an illustration of the essential effects, and so we will begin by discussing binary pulsars 
within this class of theories. 


Scalar-tensor theories 


We combine the results of Section 10.3.2, notably Eqs. (10.48), (10.49), (10.59), and 
(10.65) to obtain the metric coefficients a* and y* and the periastron advance rate 
(ww). We then combine Eqs. (11.84), (11.85), and (11.121) to obtain Py. Recalling that 
Py = 2n(a3/Gm)'/? where 


G=1—¢€+C(1 — 2s,)(1 — 2s), (12.31) 


and ¢ = (4 + 2wo)~!, we obtain expressions for the post-Keplerian parameters in scalar- 
tensor theories: 


p, \"3 es 
7! = em (=) (1 — 2¢s2 + rs + 2¢(1 — 252)(1 +2A)r)) : 


r=m(1—¢), 
s= sinu, 
. 67 2nGm ae Pesce = 3 
= am ( Py ) [1+ 57-8. aay], 
; 1920 (2nGM\>? 2 
i= 1 ( e ) Fle) — 8nCnS? (=) Ge). (12.32) 


In the limit ¢ — 0, we recover general relativity, and all structure dependence disappears. 
The first term in Py is the combined effect of quadrupole and monopole gravitational 
radiation, post-Newtonian corrections to dipole radiation, and a dipole-octupole coupling 
term, all contributing at OPN order, while the second term is the effect of dipole radiation, 
contributing at the dominant —1PN order. 

Unfortunately, because of the near equality of neutron star masses in typical double 
neutron star binary pulsars, dipole radiation is somewhat suppressed, and the bounds 
obtained are typically not competitive with the Cassini bound on scalar-tensor theories. 
Figure 12.4 uses the ag-$9 parametrization of scalar-tensor theories of Damour and 
Esposito-Farése, to display bounds from a variety of systems and experiments. The bounds 
from the three binary neutron star systems B1913+16, JO737—3039, and B1534+12 are 
not close to being competitive with the Cassini bound on apo, or with lunar laser ranging 
(LLR) bounds, except for those generalized scalar-tensor theories with Gy < 0, where 
the strong gravity of the neutron stars induces spontaneous scalarization effects (Damour 
and Esposito-Farése, 1998). Recall that ay = (3 + 2wo)~!/? = [¢/(1 — ¢)]'/? and 
Bo = 2uhbo/(3 + 2wo)? = 2A/(1 — C). 

On the other hand, a binary pulsar system with dissimilar objects, such as one with 
a white dwarf or black hole companion, is a more promising testing ground for dipole 
radiation. As a result, the neutron-star-white-dwarf systems J1141—6545 and J1738+0333 
yield much more stringent bounds. Indeed, the latter system surpasses the Cassini bound 
for 69 > 1 and Go < —2, and is close to that bound for the pure Brans-Dicke case {y = 0 
(Freire et al., 2012). 
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Bounds on scalar-tensor theories from solar-system and binary-pulsar tests. Image reproduced with permission 
from Freire et al. (2012), copyright by Oxford University Press. 


It is worth pointing out that the bounds displayed in Figure 12.4 have been calculated 
using a specific choice of scalar-tensor theory, in which the function A(y) is given by 


A(e) = exp| aaly— v0) + 56(9 — 0) (12.33) 


where av and /39 are arbitrary parameters, and (yp is the asymptotic value of the scalar field. 
In the notation for scalar tensor theories used here, this theory corresponds to the choice 


eg 1 

2 © 2(a2 — Bolnd)’ 
where ¢9 = A(o)~* = 1. The parameters ¢ and 2 are given by ¢ = aj/(1 + a4), and 
= Bo/2(1 + af). It is useful to note that the pPpN parameter combination 48 — y — 3, 
which governs the Nordtvedt effect and other violations of SEP in the post-Newtonian limit 
is given by 


w(¢) = (12.34) 


204 Bo 
48 —7y—3=2¢(1+2A)=—4, [14 12.35 
=a C( ) 4 ( a) (12.35) 
which vanishes when {9 ~ —1. This partially explains the distorted “spike” in the curve 
for lunar laser ranging, where the violation of SEP is suppressed, and no useful bound can 
be obtained. Something similar is occurring in the other curves, albeit complicated by the 
strong gravity effects taking place in the neutron stars. 
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The bounds shown in Figure 12.4 were calculated using a polytropic equation of state, 
which tends to give lower maximum masses for neutron stars than do more realistic 
equations of state. It is not known how much the use of stiffer equations of state would 
alter the bounds on this class of theories. 


Other theories 


Bounds on various versions of TeVeS theories have also been established, with the tightest 
constraints again coming from neutron-star-white-dwarf binaries (Freire et al., 2012); in 
the case of TeVeS, the theory naturally predicts y = 1| in the post-Newtonian limit, so the 
bounds from Cassini are irrelevant here. 

Strong constraints on the Einstein-A:ther and Khronometric theories were also set using 
binary pulsar measurements, exploiting both gravitational-wave damping data, and data 
related to preferred-frame effects (Yagi et al., 2014a, 2014b). For Einstein-Ether theory, 
for example, the approximate bounds were c, < 0.03 and c_ < 0.003. 


12.2 Inspiralling Compact Binaries and Gravitational Waves 
——>>>E>————— ESSE 


A new era for testing general relativity began on September 14, 2015, with the first 
detection of gravitational waves by LIGO (Abbott et al., 2016c). The signal, denoted 
GW150914, was the final burst of gravitational waves from the inspiral and merger of 
two black holes. Even though the signal was detectable above the noise for only 0.2 
seconds, three phases could be clearly delineated: a “chirp” phase of increasing amplitude 
and frequency, corresponding to the late inspiral phase, a “merger” phase, where the two 
black holes were becoming one, and a “ringdown” phase, where the final, perturbed black 
emitted exponentially damped radiation from its quasinormal mode oscillations before 
settling down to a stationary black hole (see Figure 12.5 for an illustration). Analysis of 
the chirp radiation showed that the black holes had masses 26 and 39 Mj, while analysis 
of the ringdown radiation revealed that the final black hole had a mass of 62 Me and 
a spin parameter x ~ 0.67. The three solar masses converted to energy during the 
process corresponded to an approximate luminosity of 3.6 x 10°° erg s~!, larger than the 
luminosity of all the stars in the observable universe combined. 

Already with the first detection, important tests of general relativity were carried out. 
One test checked the consistency between the observed signal and theoretical template 
waveforms based on combining post-Newtonian theory with numerical relativity within 
general relativity. Another test placed a bound on the mass of the graviton that improved 
upon the bound derived from solar system dynamics. The fourth detection, GW170814, 
which included the Virgo detector (Abbott et al., 2017c), resulted in an interesting test of 
the polarization content of the gravitational waves. The most recent detection GW170817 
was a binary neutron-star inspiral (Abbott et al., 2017d). This observation was notable 
because it was also observed across the entire electromagnetic spectrum, with major 
implications for gamma-ray bursts, the neutron-star equation of state, and the synthesis of 
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Schematic gravitational wave from binary inspiral, showing the inspiral, merger and ringdown parts of the 
waveform. 


the heaviest elements (Abbott et al., 2017e). It also produced a very precise direct bound 
on the difference between the speeds of gravitational waves and light. When the analysis 
is complete, it may also contribute additional tests of relativistic gravity. 

In this section we will lay the groundwork for discussing gravitational waves from 
binary inspiral within general relativity and scalar-tensor theory, and then will discuss a 
few parametrized frameworks for analysing alternative theories. Finally we will describe 
the results from specific tests carried out so far. This field is truly just beginning, and so it 
is likely that much of what is presented here will soon be out of date! 


12.2.1 Compact binary inspiral in general relativity 


To keep the discussion simple, we will assume that general relativity is correct and that the 
two compact bodies are in quasi-circular orbits, which means orbits that are circular apart 
from the adiabatic inspiral induced by gravitational radiation. For compact binaries that 
have evolved in isolation for considerable time, this turns out to be an excellent approxi- 
mation. As we learned in Section 11.5.2, the orbital eccentricity decreases with semimajor 
axis approximately as e = e9(a/ao)!9/'? = eo(P/Ppo)!?/!8. Thus, for example, by the 
time the Hulse-Taylor binary pulsar B1913+16 has spiralled inward until its orbital period 
is around 0.1 seconds, so that its gravitational waves will be entering the LIGO-Virgo 
sensitive band, its eccentricity will be 10~°. On the other hand, for an “extreme mass-ratio 
inspiral,” in which a stellar-mass compact object is injected from a surrounding star cluster 
into a highly eccentric orbit around a supermassive black hole, there may not be enough 
time for the orbit to circularize, and thus it may be important to include eccentricity. 

We will also assume that the bodies are spinning, but we will assume that the spins are 
either aligned or anti-aligned with the orbital angular momentum. This means that we will 
ignore the precessions of the spins and of the orbital plane that result from spin-orbit and 
spin-spin coupling (see Section 9.1 for discussion). These precessions have the effect of 


293 


12.2 Inspiralling Compact Binaries and Gravitational Waves 


modulating the amplitude and phase of the gravitational waveform in complex ways. There 
has not been sufficient information in the short binary black hole waveforms observed so 
far to detect precession effects, but they will surely be important for binary black hole 
inspirals in the future. Similarly, it is not yet known whether spin precession played a role 
in the binary neutron-star inspiral event. For simplicity, we will ignore such complications, 
and treat only the aligned spin cases. 

We will ignore tidal interactions. For binary black holes, these effects are unimportant 
until the onset of the actual merger of two event horizons. For systems containing neutron 
stars they will also be unimportant until the final few orbits, but then the signature of tidal 
distortion and even disruption of the neutron star as imprinted on the gravitational wave 
signal could carry important information about the equation of state of the matter in the 
neutron star. In order to test alternative theories of gravity, it is desirable to avoid regimes 
where such complex phenomena could occur. 

Finally, we will treat only the part of the inspiral that is governed by the post-Newtonian 
approximation. 

To obtain the relevant conditions for a circular orbit, we take the PPN two-body equation 
of motion (6.70) without the Jz term, choose the PPN parameters for general relativity and 
impose the constraint r = 7 = 0. This yields a condition on the orbital angular velocity w 
given by w* = (m/r?)[1—(3—n)m/r]. Including the spin-orbit and spin-spin contributions 
to the equations of motion, Eqs. (9.4a) and (9.4b), along with 2PN terms in the equations of 
motion (see, for example, Blanchet et al. (1995a)), we obtain 


> mM m m\3/2 m A 
w ac 73 1”) (=) 2 272 +n h-X, 


m\2 41 s 3 : 
ses) {o+ Sata — 50 (xi x2 ~ 3h 2-2) |, (12.36) 


where x, = S,/m?, and h is the unit vector in the direction of the orbital angular 
momentum x x v. Similarly, the energy of the circular orbit is given by 


m lm m\ 3/2 m2 a 

E= i 7 ) 2a h- 
"S| as n+ (= & (2%: +n) Xe 
1 
8 


m 
i 


( yf 49 — 1° a(x: -x2—3h-xaht-x2)} (12.37) 


The rate of loss of energy is given by Eq. (11.109), with spin-orbit and spin-spin effects 
added (Kidder et al., 1993; Kidder, 1995), 
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The orbital angular frequency w = 27f,, where f, is the binary orbit frequency. But for 

quadrupole radiation from a circular orbit, the gravitational-wave frequency F is given 

by F = 2f,. Combining Eqs. (12.36), (12.37), and (12.38) we can obtain an equation for 

the evolution of the gravitational-wave frequency with time as a function of F, given by 
dF /dt = (dE/dt)/(dE/dF), or 

a = 07 (nM)! 1 am (3 de 7") (nmF)*? + (4m — B)(amF) 
34103. 13661 , 59 9 
(= "2016 / * is” 


where ( and o are the spin-orbit and spin-spin contributions, given by 


a) (rey | , (12.39) 


coe > 113% 4759) h 
= DD : m2 + 1) . Xa> 
c= i (247%, x + 72h xh: x2) (12.40) 


Note that (mF)?/ ~ v ~ m/r. If the source is at a sufficiently great distance that 
cosmological redshifts become significant, then the observed frequency Fy; = F/(1 + Z) 
and the observed time interval is dtoy; = (1 + Z)dt. Then Eq. (12.39) applies equally well 
to the observed quantities provided that we define the “observed” or redshifted masses 
Movs = M(1 + Z) and mops = m(1 + Z). 

Combining Eqs. (11.47) and (11.108) for a circular orbit, and substituting r = 
m'/3(2nF)~2/3[1 + O(e)] into the wave amplitude, we can write the response function 
S(t) of a laser interferometer in the general form 


S(t) = “f Olangles)(x MF)?" cos ®(t), (12.41) 


where ®(7) = 26(t) = Inf ‘F(t’)dt’ is the gravitational-wave phase. Integrating 
Eq. (12.39) to obtain F'as a function of ¢, and then integrating to obtain the phase, we obtain 


1 5 (743 11 5 
&(F) = ©, — —(nMF)>" 1 +3 (Te + 7") (rmFy? — 5 (4n — B)(xmF) 


3058673 5429 617 
5 2 4/3 
Ga 1008” * 144” a) (nmF)"" | (12.42) 


where ®, is a constant. Thus we have an accurate prediction (under the chosen 
assumptions) for the gravitational-wave signal at the detector. This is essential for 
confirming a detection and for the measurement of the source parameters (Cutler et al., 
1993), which include distance, position in the sky, orientation of the orbital plane, and 
the masses and spins of the companions. Roughly speaking, the measured signal (which 
includes detector noise) is passed through a linear filter constructed from the theoretical 
signal S(t; @) and the spectral density of the detector noise (Wainstein and Zubakov, 1962). 
The theoretical signal is expressed as a function of an abstract vector 8, which collectively 
represents the source parameters, such as R, tc, M, 7, and so on. The signal-to-noise ratio 
S/N is then computed (see below). The actual values of these parameters 6 are unknown 


prior to the measurement. When 6 = 9, the linear filter becomes the Wiener optimum 
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filter which is well known to yield the largest possible signal-to-noise ratio (Wainstein and 
Zubakov, 1962). A detection can be confirmed and the source parameters determined by 
maximizing S/N over a broad collection of expected signals S(¢;@), loosely referred to 
as “templates.” This method is called “matched filtering”; see Finn and Chernoff (1993), 
Cutler and Flanagan (1994), and Poisson and Will (1995) for the foundations of this 
approach for binary inspiral. 

It has been established that it is the phasing of the signal that plays the largest role in 
parameter estimation. This is because a slight variation in the parameters can quickly cause 
h(t; @) to get out of phase with respect to the true signal A(t; @), thus seriously reducing 
S/N from its maximum possible value. Therefore a good match between the phases of the 
template and the measured signal throughout the observed cycles singles out, to a large 
extent, the value of the source parameters. 

The leading term in ®(F) gives the chirp mass. If there is sufficient sensitivity to measure 
the 1PN correction term, then because of the dependence on 7 and m, one can measure the 
individual masses m,; and mz. Measuring higher-order terms can give information about 
the spins, and can also yield tests of general relativity. 

The key ingredient in matched filtering is the Fourier transform of S(t), given by S (f) = 
(27)! f ee S(t)e?""dt. Using the stationary phase approximation, and confining attention 
to positive frequencies, it is straightforward to show that 


S(f) = Af-/6e , (12.43) 
where A x M*/°Q(angles) /R, and 


= if: 
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where f, is the time corresponding to the phase ®,, and the gravitational-wave frequency F’ 
is to be replaced by the Fourier frequency fin Eq. (12.42). In general relativity, the phase 


w(f) is given by 
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Then, for a detector characterized by a Gaussian noise spectral density S,,(f), the signal- 
to-noise ratio associated with a measurement of a signal S(¢) is given by 


° SP 
S/N= 4/ df. (12.46) 
0 Sn(f) 
Defining the Fisher information matrix by 
Typ =2 / © Sun + SaS% df (12.47) 
ab = > é 
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where subscripts ,a and ,b denote partial derivatives with respect to one of the parameters 
0° characterizing the signal, it can be shown that the error o, in measuring the parameter 
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6“, and the correlation coefficients c“” between parameters 6“ and 0? are obtained from the 
inverse of the Fisher matrix, according to the prescription 
og = ((A6*)’) = (0-')™, 
AG@ Ag? pe! ab 
oc? = ( a (12.48) 


OaPbh Oa9p 


This sensitivity to phase is one of the keys to testing general relativity with gravitational 
waves. In an alternative theory of gravity, the coefficients that appear in Eq. (12.45) may 
differ from those in general relativity, and if those differences depend on a parameter, such 
as the scalar-tensor coupling parameter wo, one can use matched filtering to place a bound 
on such parameters. 


2.2.2 Compact binary inspiral in scalar-tensor theories 


At present, the evolution of the gravitational-wave frequency in scalar-tensor theory cannot 
be written down to the same PN order as that displayed in Eq. (12.39) for general relativity. 
This is because dipole gravitational radiation contributes to the waveform at —0.5PN order 
and to the energy flux at —1PN order, and therefore to obtain terms in dF'/dt at nPN order 
beyond the quadrupole term, one needs to evaluate the scalar wave field and the equations 
of motion to (n + 1)PN order. In addition, the clean expansion in powers of (nmF)?/3 ~ v? 
shown in Eq. (12.39) is no longer so clean when dipole radiation is present. Even though 
the dipole term is formally of leading order, its contribution to the waveform and energy 
flux is proportional to ¢. Since solar-system experiments already place the constraint ¢ < 
10~>, and S is typically less than 0.5, the dipole flux is likely to be small compared to the 
quadrupole flux in the late inspiral phase, where v > 1077, except for situations where 
spontaneous scalarization yields extremely large values of S. Thus a double expansion of 
quantities such as dF /dt and ®©(F) must be carried out, with one parameter being (1mF)?/°, 
the other being the ratio of the two dominant types of flux (see Sennett et al. (2016) for 
discussion). The expansions will look very different depending on whether the evolution 
is dipole-radiation dominated or quadrupole-radiation dominated. 

Here we will keep things simple, and work only to the equivalent of quadrupole order, 
mainly to demonstrate how dipole radiation alters the frequency evolution of the inspiral 
(Will, 1994). From the binary equation of motion (10.69) without spins, we follow the 
method described in Section 12.2.1 to obtain the quasicircular orbit expressions (compare 
Egs. (12.36) and (12.37)) 


uw = on 1 gm (3 n+7 +28, +2A8.)| , 
2. 
ee 1 LET eps +)). (12.49) 
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From Eqs. (11.59), the energy loss rate from a quasicircular orbit is given by 
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where « is given by Eq. (11.121). The final result for the evolution of the gravitational- 
wave frequency is 


dF 96m 2 5/3 5 —2/3 ~ 62 
Fm F (TGMF) [e+ 5g (TGmF)*°CS? | , (12.51) 
where 
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Whereas in general relativity, the post-Newtonian corrections to the leading quadrupole 
frequency evolution are given by a power series in increasing powers of (tGmF)*/?, the 
dipole correction term depends on a negative power that parameter, reflecting its —1PN 
nature. But by measuring the evolution of F in an inspiral, one can place a bound on the 
scalar-tensor parameter ¢. Unfortunately, for the sensitivities of the advanced LIGO-Virgo 
instruments, the bounds are not likely to be competitive with the solar-system bound (Will, 
1994), except possibly for scalar-tensor theories that admit spontaneous scalarization, 
leading to anomalously large values of Si. On the other hand, such bounds would be 
derived from tests of the strong-field dynamical regime, not the weak field, slow motion 
regime of the solar system. Note, however, that for binary black hole inspirals, with 
S| = s. = 1/2, the gravitational-wave signal is identical to that of general relativity, 
after rescaling the masses by the factor 1 — ¢, so no test of basic scalar-tensor theory is 
possible for such sources. One needs a neutron star inspiral, such as the event GW170817, 
to test these theories. 

Sennett et al. (2016) have analyzed the next-order contributions to the frequency and 
phase evolution. 


2.2.3 Compact binary inspiral in other theories 


Compact binary inspiral in other theories of gravity has not been analyzed systematically 
in the same manner as have general relativity and scalar-tensor theories, so only limited 
conclusions about alternative theories can be drawn from the gravitational-wave detections 
made to date (Yunes et al., 2016). On the other hand, a number of phenomenological 
parametrizations have been developed that make it possible to place some preliminary 
bounds on general classes of theories. 


Bounds on the graviton mass and the speed of gravitational waves 


If gravitation is propagated by a massive field, as in massive gravity theories (Section 5.7), 
then the velocity of gravitational waves will depend upon their wavelength as ve = 1 = 
(A/Ag)* [Eq. (11.10)], where Ag = h/mg is the graviton Compton wavelength. In the case 
of inspiralling compact binaries, gravitational waves emitted at low frequency early in the 
inspiral will thus travel slightly more slowly than those emitted at high frequency later, 
resulting in an offset in the relative arrival times at a detector. This modifies the observed 


298 


Strong-Field and Dynamical Tests of Relativistic Gravity 


frequency evolution of the observed inspiral gravitational waveform, similar to that caused 
by post-Newtonian corrections to the quadrupole evolution (Will, 1998). 

To make this quantitative, we consider the propagation of a massive “graviton” in a 
background Friedmann-Robertson-Walker (FRW) homogeneous and isotropic spacetime, 
with the line element 


ds’ = —dt +a’ (t)[dx’ + D°(x)(d6? + sin°6d¢’)], (12.53) 


where a(t) is the scale factor of the universe and X(v) is equal to x, sin y or sinh y if the 
universe is spatially flat, closed or open, respectively. For a graviton moving radially from 
an emitter at y = x. toa receiver at x = 0, itis straightforward to show that the component 


of the graviton’s four-momentum p, = constant. Using the fact that m; = —p"p® gag = 
FE? — iP. where E = p®, together with pX /E = dx/dt, we obtain 
-1/2 
d 1 weg? 
xX _ ieee : (12.54) 
dt a PY 


where p, = a°(te)(E; —m;). Assuming that E, >> mg, expanding Eq. (12.54) to first order 
in (mg/E.)* and integrating, we obtain 


ede TL tie of? 
xe= | a aa a(t)dt. (12.55) 


Consider gravitons emitted at two different times ¢, and ¢/, with energies E, and E’,, and 
received at corresponding arrival times (ye is the same for both). Assuming that At, = 
te — t, < a/a, and noting that mg/E. = (AgF.)~|, where F, is the emitted frequency, we 
obtain, after eliminating x, 


D 1 1 
— 7! an 
t, =t, +(14+Z) |. t, 4 me (2 *)| ; (12.56) 


where Z = ao/a(t,) — 1 is the cosmological redshift, and 


D= CO Paina 


ao 


e 


7 a dZ’ 
Ho Jo (1 42/)?[Qu( +23 + Q4]'/?? 


where ap = a(t,) is the present value of the scale factor. The expression in the second line 
corresponds to a spatially flat universe with matter density parameter (24 and dark-energy 
parameter Q,, with Qy+Q, = 1. Note that D is not a conventional cosmological distance 
measure, like the luminosity distance given by 


(12.57) 


“dt 
Dip =(14+Z : aie 
Bat) a 
14+Z [7 dZ' 
= 12.58 
Ao | [Qu(1 + Z')3 + Qa]!/? 


For a standard ACDM cosmology, D/D, ranges from unity at Z = 0 to 0.3 at Z = 3. 
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Because the arrival time of the signal now depends on frequency, the observed frequency 
evolution is given by 

lola -Z) 

dt dt, dt. dt,’ 


where we have inserted the redshift factor linking F, with F.. We now calculate dt,/dt, 
using Eq. (12.56), and insert the general relativistic quadrupole evolution for dF../dt., 
Eq. (12.39), including the 1PN correction. We are assuming here that, in whatever massive 
gravity theory is being employed, the intrinsic frequency evolution of the system is given 
by the formulae from general relativity, ignoring corrections of fractional order (r/ Ag) 
where + is the size of the binary system. Since solar-system bounds already imply that 
Ag > 10!2 km, these are likely to be small corrections. We then obtain, for the observed 
frequency evolution, 


dF %6r 
dt 5 


(12.59) 


P(t MF)S3 1 7 (= + on + 6.) (nmFY? +. | , (12.60) 


where F’, m, and M are the observed frequency and masses, and where 


_ 96 n?>Dnm 


= = 0 4Z) (12.61) 


The effect of the graviton mass is to modify the coefficient of the 1 PN term in the frequency 
evolution. Thus with sufficiently accurate measurements of the frequency evolution, it 
is possible to measure the individual masses and to place a bound on {g. Early studies 
estimated that advanced LIGO-Virgo detectors could place a bound Ag > 10'? km, while 
the LISA space detector could place a bound A > 10!° km (Will, 1998; Berti et al., 2005; 
Stavridis and Will, 2009; Arun and Will, 2009). 

The dispersion relation E7 = p* + m,, for a massive graviton is Lorentz invariant. 
Mirshekari et al. (2012) considered extensions to theories of gravity that effectively violate 
Lorentz invariance, in which the dispersion relation could take the form 


FE? =p’ +m, + Ap, (12.62) 


where A and a are two parameters characterizing the Lorentz violation. For example, in 
some extra-dimension theories, a = 4 and A is a parameter of the order of the square of 
the Planck length. Mirshekari et al. (2012) studied the bounds that could be placed on mg 
and A for various values of a. 

Tests of the massive graviton hypothesis have now been carried out using data from 
gravitational-wave detections. Data from the discovery event GW150914, led to the bound 
Ag > 1013 km (Abbott et al., 2016c,d). This was improved to 1.6 x 10!3 km by combining 
data from the three events GW150914, GW151226, and GW170104. (Abbott et al., 
2017b). 

This limit is comparable to the solar-system limit, which comes from assuming that 
massive gravity also implies a modification of the Newtonian potential by the inclusion of 
a Yukawa factor, that is, U = (m/r)e~"/*«. Data on Mercury’s perihelion advance and on 
orbits of the outer planets imply that \ > 10!* km (Talmadge et al., 1988), although recent 
improvements in knowledge of Mercury’s orbit described in Section 7.3 probably push 
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that bound above 10!? km. Some have argued for a larger bound on Ag from galactic and 
cluster dynamics (Hiida and Yamaguchi, 1965; Hare, 1973; Goldhaber and Nieto, 1974), 
noting that the evidence of bound clusters and of clear tidal interactions between galaxies 
argues for a range Ag at least as large as a few megaparsecs (6 x 10!° km). However, in 
view of the uncertainties related to the amount of dark matter in the universe, and the fact 
that massive gravity theories frequently include other modifications of gravity on large 
scales, these bounds should be viewed with caution. Even the dynamical arguments used 
to establish these large-scale bounds have been questioned recently (Christodoulou and 
Kazanas, 2017; Mukherjee and Sounda, 2017). Bounds on the Lorentz-violating dispersion 
parameter A were also placed using combined data from GW150914, GW151226, and 
GW 170104 (Abbott et al., 2017b). 

A very strong bound was placed on the difference in speed between light and 
gravitational waves using the binary neutron-star event GW170817 (Abbott et al., 2017a). 
Because the gravitational-wave event could be associated with the gamma-ray burst event 
GRB170817A in the galaxy NGC 4993, the observed time difference between the two 
signals, 1.74 + 0.05 s, combined with the estimate of the distance of 42.9 + 3.2 Mpc, 
produced the bound 


—3x 107% <cg—-1<7x 107". (12.63) 


Bounds on polarizations of gravitational waves 


A restricted test of gravitational-wave polarizations was carried out in 2017, when the 
Virgo detector joined LIGO for several weeks toward the end of LIGO’s second observing 
run. Using data from the binary black hole inspiral GW170814, seen in all three detectors, 
a Bayesian analysis was carried out in which a fit to the data was carried out for pure tensor 
response (i.e., including only F', and F,,, in the response), pure scalar response (only F's and 
F.) and pure vector response (only Fy; and Fy2). The quadrupole response was strongly 
favored over the two alternatives by Bayes factors of 200 and 1000, respectively (Abbott 
et al., 2017c). 


Parametrizations of binary inspiral signals 


Several frameworks have been developed, modeled on the PPpN formalism, in which 
arbitrary parameters are introduced into the predicted gravitational-wave signal from 
binary inspiral in order to encompass alternative theories of gravity. Arun et al. (2006a) 
wrote the Fourier phase «)(f) of Eq. (12.45) in the form 


T 3 ‘A. 
Uf) = 2nft. — be 4+ Lo Tag mn" Wag, (12.64) 
k 


where a, are arbitrary parameters. The general relativistic values of the parameters can be 
read off from the coefficients inside the square brackets of Eq. (12.45) (see also Arun et al. 
(2006b) and Mishra et al. (2010)). 

Yunes and Pretorius (2009) extended this post-Newtonian parametrization to include 
nongeneral relativistic frequency dependences, in the form 
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N-1 
U() = 2mfte— be — 5+ Yi demas”, (12.65) 
k=0 
where ¢, and b, are each a set of N parameters. This parametrization allows for the 
presence of dipole radiation, which begins at order (Mf)~7/7. They also included 
parametrized waveform templates for both the merger and ringdown phase. This is known 
as the parametrized post-Einsteinian (PPE) formalism. 

These parameterizations were extended and adapted for specific use within the LIGO- 
Virgo data analysis pipeline, based on Bayesian model selection (Li et al., 2012; Agathos 
et al., 2014). This “test infrastructure for general relativity” (TIGER) was used in a joint 
analysis of the data from GW150914 and GW151226 to place limits on the various 
parameters of the model (Abbott et al., 2016a). For example, the OPN, 1PN, and 1.5PN 
coefficients in the model were consistent with general relativity to between 10 and 20 
percent; higher PN-order coefficients were more poorly constrained. With more detections, 
these constraints are likely to improve. 


12.3 Exploring Spacetime near Compact Objects 
a ee ae | 


One of the difficulties of testing GR in the strong-field regime is the possibility of con- 
tamination by uncertain or complex physics. In the solar system, weak-field gravitational 
effects can in most cases be measured cleanly and separately from nongravitational effects. 
The remarkable cleanliness of many binary pulsars permits precise measurements of 
gravitational phenomena in a strong-field context. The gravitational waves from inspiraling 
compact binaries detected by LIGO and Virgo appear to be likewise amazingly clean, not 
entirely surprising, since four of the first five detections involved binary black holes. 

Unfortunately, nature is rarely so kind. Still, under suitable conditions, qualitative and 
even quantitative strong-field tests of GR are possible. The combination of improved 
astronomical observations in all wavelength bands, from radio to gamma rays, and better 
theoretical and computational modelling of complex physical processes have offered the 
promise of performing striking tests of strong-field predictions of GR. This is a rapidly 
evolving field, and therefore we will be unable to give a thorough account here. Instead we 
will give a few examples of interesting arenas where possible astrophysical tests of general 
relativity might occur, and will otherwise refer readers to a number of recent reviews 
(Psaltis, 2008; Berti et al., 2015; Johannsen, 2016). 


12.3.1 SgrA*: A black hole in the galactic center 


Lynden-Bell and Rees (1971) suggested that there might be a supermassive black hole in 
the center of the Milky Way, and Balick and Brown (1974) observed a bright, unresolved 
synchroton radio source at the precise galactic center, naming it Sagittarius A*, or SgrA* 
for short. While evidence mounted for the existence of massive black holes in the centers 
of quasars and active galactic nuclei, a black hole in the Milky Way remained only an 


302 


Strong-Field and Dynamical Tests of Relativistic Gravity 


intriguing possibility until advances in infrared interferometry and adaptive optics made 
it possible to detect stars very near the location of SgrA* and to observe their orbital 
motions (Eckart and Genzel, 1996; Ghez et al., 1998). A major breakthrough came with 
the unambiguous determination of the orbit of the star denoted S2 with an orbital period 
of about 16 years, and projected semimajor axis of 0.1 arcseconds (Schédel et al., 2002; 
Ghez et al., 2003). This made it possible, using Kepler’s third law, to determine the mass 
of the black hole directly. Combined with more recent data on S2 and on other stars in the 
central cluster, this has led to an estimate for the black-hole mass of 4.4 + 0.4 x 10°Mo 
and an improved value for the distance to the galactic center of 8.3 + 0.4 kiloparsecs (see 
Genzel et al. (2010) for a thorough review of SgrA*). In addition to opening a window on 
the innermost region of the galactic center, the discovery of these stars has made it possible 
to contemplate using orbital dynamics to probe the curved spacetime of a rotating black 
hole, with the potential to test general relativity in the strong-field regime. 

Soon after the detection of stars orbiting SgrA*, numerous authors pointed out the 
relativistic effects that are potentially observable, including the gravitational redshift and 
time-dilation effects (Zucker et al., 2006) and the pericenter advance (Jaroszynski, 1998; 
Fragile and Mathews, 2000; Rubilar and Eckart, 2001; Weinberg et al., 2005; Kraniotis, 
2007). Although these effects were not detectable with the instrumentation at the time, 
notably the infrared interferometer at the Keck Observatory in Hawaii and the Very Large 
Telescope infrared Interferometer (VLTI) at the European Southern Observatory, the next 
generation of the instrumentation may bring relativistic tests within reach. The upgrade of 
the VLTI, known as GRAVITY, made its first observations of the galactic center in June 
2016 (GRAVITY Collaboration et al., 2017). The Thirty Meter Telescope (TMT), currently 
under development, will also have the capability to make such measurements. 

For example, the gravitational redshift of spectra of the stars S2 and S102 will be 
detectable during their next pericenter passages in 2018 and 2021, respectively. The 
pericenter precession of S2 is another target for detection in 2018. In general relativity, 
the leading contribution to the pericenter advance rate is given by wW = 67m/Pya(1 — e”). 
However, if the precession of the orbit is being measured astrometrically from Earth, then 
we must project the orbit onto the plane of the sky and determine the rate of change of the 
relevant point on the orbit as seen from Earth. Thus the projected pericenter shift is given 
by Wproj = [a(1 + e)/R]w cosu where the plus (minus) sign corresponds to measuring the 
shift at apocenter (pericenter), 4 is the inclination of the orbit, and R is the distance to the 
galactic center. Using the mass and distance of SgrA* quoted earlier, the result is 


1 
Wproy = 98.3 pias yr! (3) fee (12.66) 


where the minus sign now corresponds to a measurement at apocenter. For S2 and S102, 
the rates are 37 and 23 microarcseconds per year. Current observations of S2 and S102 are 
within a factor of 10 of measuring the precession (Hees et al., 2017). 

Among the goals of second-generation projects such as GRAVITY and the TMT are 
astrometric precisions of tens of microarcseconds per year, and sensitivities enabling the 
detection of stars orbiting much closer to the black hole than the S-stars (if such stars exist). 
This makes it possible to consider doing more than merely detect relativistic effects, but 
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Table 12.3 Orbital parameters of selected stars orbiting the galactic center black hole SgrA*. Data taken 
from Gillessen et al. (2009) and Meyer et al. (2012). 


Star a (a.u) e u(°) Py (yr) 

$2 1020 + 8 0.880 + 0.003 135.25 + 0.47 15.8 +0.1 
S4 2470 + 160 0.406 + 0.022 Hilgs3) ae 3 Bes) as PAK0) 
$5 2080 + 350 0.842 + 0.017 143.7 + 4.7 45.7 + 6.9 
S9 2430 + 430 0.825 + 0.020 81.0 + 0.7 58.0 + 9.5 
S14 2125 + 83 0.963 + 0.006 99.4 + 1.0 47.3 + 2.9 
$38 1160 + 340 0.802 + 0.041 166 + 22 18.9+5.8 
$102 SE, 0.68 + 0.02 alae 3} ILS) se 0.3) 


rather to provide the first test of the black hole “no-hair” or uniqueness theorems of general 
relativity (Will, 2008). According to those theorems, an electrically neutral black hole is 
completely characterized by its mass m and angular momentum J. As a consequence, all 
the multipole moments of its external spacetime are functions of m and J. Specifically, the 
quadrupole moment 0, = —J? /m. 

To see how such a test might be carried out, we work in the post-Newtonian limit, and 
write down the equation of motion for a test body in the field of a body with mass m, 
angular momentum J, and quadrupole moment Q>. This can be obtained from Eq. (6.70), 
using general relativistic values of the PPN parameters, setting 7 = 0, and including the fact 
that the dimensionless quadrupole moment J2 and Q» are related by mR2Jy = —Q>. We 
also include the spin-orbit terms from Eq. (9.4a), with y = 1, a; = 0, and S$; = m, = 0, 
hence o = 0. The result is 
dv mn > 4m 4mvr 

i il = 


2 


r 
- * [20 xe—3inxe—3r 'n(h-e)| 
= os [Sn(n-e)? —2e(n-e) —n], (12.67) 


where v is the velocity of the body, n = x/r,h = x x v, and e = J/J is a unit vector 
along the symmetry axis of the black hole. If we define a dimensionless spin parameter y 
by y = J/m?’, then, for a Kerr black hole, 0 < x < 1 and QO) = —M?y’. 

We consider a body with orbital elements a, e, 4, 2 and w defined using the plane of the 
sky as the X—Y plane of the reference system (see Figure 6.1). The axis of the black hole 
has an unknown orientation relative to the reference system. Substituting the perturbing 
acceleration of Eq. (12.67) into the Lagrange planetary equations (6.74) and integrating 
over an orbit, we obtain the changes Ae = Aa = 0, and 


1 
Aw = As — 24;cos a — x40(1 —3cosa), 


AQ = a ae ee 
sine 


Ai = sinacos B(A;— Agcosa), (12.68) 
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where 
As = 6nm/p, 
Ay =4nJ/(mp*)'/? , 
Ag = 3nQ>/mp’, (12.69) 


where Aw = Aw + cosz AQ, and p = a(1 —e”). The angles a and £ are the polar angles 
of the black hole’s symmetry axis e with respect to the star’s orbital plane defined by the 
line of nodes eg, and the vector in the orbital plane e, orthogonal to eg and h, according 
to cosa =e-h, sinacos 6 =e-egand sinasinG=e-e,. 

The structure of the expressions for AQ and Av can be understood as follows: 
Eq. (12.67) implies that the orbital angular momentum h precesses according to dh/dt = 
wp x h, where the orbit-averaged precession vector is given by wp = e(Ay — Ag, cosa). 
The orbit element variations are given by di/dt = wp - eg and sinudQ/dt = wp-e,. As 
a consequence, we have the purely geometric relationship, 


sinedQ/dt _ 

di/dt 

To get rough idea of the astrometric size of these precessions, we define an angular 

precession rate amplitude 9, = (a/R)A,/Pp, where R is the distance to the galactic center 

and Py is the orbital period. Using m = 4.4 x 10° Mo, D = 8.3 kpc, we obtain the rates, in 
microarcseconds per year, 


tan 3. (12.70) 


Os ¥ 98.3P '(1—e)7!, 
O, = 1.07xP,*F(1 — e)3/?, 
Og © 13 x 1077 2P, (1 — e)-?, (291) 


where P, is measured in years. The observable precessions will be reduced somewhat 
from these raw rates because the orbit itself must be projected onto the plane of the sky. 
As we noted earlier, the contribution to Aq is reduced by cos, and the contribution to 
Az is reduced by sinv. For an orbit in the plane of the sky, changes in the inclination 
are unmeasurable, and changes in the nodal angle become degenerate with changes in the 
pericenter. 

For the quadrupole precessions to be observable, it is clear that the black hole must have 
a decent angular momentum (y > 0.5) and that the star must be in a short-period high- 
eccentricity orbit. For example, for x = 0.7, Ph = 0.1 yr, and e = 0.9, the three amplitudes 
listed in Eq. (12.71) have the values 5200, 195 and 8 pas per year, respectively. 

Although the pericenter advance is the largest relativistic orbital effect, it is not the most 
suitable effect for testing the no-hair theorems. The frame-dragging and quadrupole effects 
are small corrections of the leading Schwarzschild pericenter precession, and thus one 
would need to know m, a and e to sufficient accuracy to be able to subtract that dominant 
term to reveal the smaller effects of interest. Furthermore, the pericenter advance is affected 
by a number of complicating phenomena, including 2PN effects, the disturbing effects of 
surrounding mass in the form of gas, stars or dark matter, and the effects of tidal distortion 
of the star near its pericenter. 
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On the other hand, the precessions of the node and inclination are relatively immune 
from such effects. Any spherically symmetric distribution of mass has no effect on 
these orbit elements. As long as any tidal distortion of the star is quasi-equilibrium with 
negligible tidal lag, the resulting perturbing forces are purely radial, and thus have no 
effect on the node or inclination. On the other hand, even if a surrounding cluster of stars 
is spherical on average, the “graininess” of the perturbing forces from a finite number of 
such stars will have an effect on the orbital plane of a chosen target star. These perturbing 
effects were studied for a range of hypothetical distributions of cluster stars by Merritt et al. 
(2010) and Sadeghian and Will (2011) 

In order to test the no-hair theorem using the precessions AQ and Av, we need to 
measure four quantities, the magnitudes J and Q2, and the angles a and ({. Thus it will 
be necessary to measure AQ and Av for at least two stars whose orbital planes are 
appropriately nondegenerate. Even if the precision is not sufficient to be sensitive to the 
small quadrupole effect, it would still be possible to measure the spin magnitude and 
direction of the black hole, which would shed light on how it evolved and grew during 
the lifetime of the galaxy. 

Observing stars is not the only way to explore the spacetime near SgrA*. If a pulsar were 
observed orbiting sufficiently close to the black hole, observations of its orbital precession 
using pulsar timing rather than astrometry could also contribute to a test of the no-hair 
theorem (Wex and Kopeikin, 1999; Liu et al., 2012). In addition to stars, there are a 
number of gaseous disks orbiting the central black hole (Genzel et al., 2010). Working at 
submillimeter wavelengths and linking a worldwide set of radio telescopes to form a giant 
VLBI array, a collaboration known as the Event Horizon Telescope (EHT) is approaching 
the capability of imaging SgrA* with event-horizon-scale angular resolution (Doeleman 
et al., 2009). Observation of accretion phenomena in the innermost disk at these angular 
resolutions could provide tests of the spacetime geometry very close to the black hole. 
(Johannsen et al., 2016a,b). And combining data from stellar astrometry, pulsar timing and 
EHT could have significant advantages for testing the no-hair theorem (Psaltis et al., 2016). 

The observations needed to explore the strong-field region around SgrA* are very 
challenging, but steady progress is being made, notably with GRAVITY and the EHT. In 
addition, some luck will be called for: it is not known whether a population of sufficiently 
bright young stars exists close enough to the black hole to make no-hair tests feasible. Nor 
is it known if suitable pulsars exist sufficiently close to SgrA*, although the discovery of a 
magnetar (a pulsar with an extremely large magnetic field) in the neigborhood of the black 
hole gives reason for hope. 


12.3.2 Neutron stars and black holes 


Neutron stars and stellar-mass black holes may also be important arenas for testing strong- 
field gravity. 

Studies of certain kinds of accretion known as advection-dominated accretion flow 
(ADAF) in low-luminosity binary X-ray sources have yielded hints of the signature of the 
black hole event horizon (Narayan and McClintock, 2008). The spectrum of frequencies 
of quasi-periodic oscillations (QPO) from accretion onto black holes and neutron stars in 
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binaries may permit measurement of the spins of the compact objects. Aspects of strong- 
field gravity and frame-dragging may be revealed in spectral shapes of iron fluorescence 
lines and continuum emission from the inner regions of accretion disks (Reynolds, 2013; 
Miller and Miller, 2015). See Abramowicz and Fragile (2013) for a review of accretion 
onto black holes and neutron stars. 

The structure of neutron stars depends strongly on both the equation of state of nuclear 
matter and the theory of gravity, and there is considerable degeneracy between these two 
ingredients in such parameters as the mass and radius of the neutron star. However, the 
discovery of the “I-Love-Q” phenomenon, a relation between the moment of inertia, 
the Love number and the quadrupole moment of rotating neutron star models that is 
remarkably insensitive to the equation of state (Yagi and Yunes, 2013), may break the 
degeneracy and open up ways to test alternative theories of gravity. 

For detailed reviews of strong-field tests of GR involving neutron stars and black holes 
using electromagnetic observations, see Psaltis (2008) and Johannsen (2016). 


12.4 Cosmological Tests 
SSS Sa BS BEE EE EES SS 


From a few seconds after the Big Bang until the present, the underlying physics of 
the universe is well understood, in terms of a standard model of a nearly spatially flat 
universe, 13.6 billion years old, dominated by cold dark matter and dark energy, called 
the ACDM model. Notwithstanding some observational tensions, such as slightly differing 
values of the Hubble parameter coming from different observational techniques, difficulties 
accounting for the distribution of structures over all galactic scales, and the failure to date 
to detect the fundamental particle that is presumed to constitute dark matter, the general 
relativistic ACDM model agrees remarkably well with a wide range of observations. 

Other theories, such as Brans-Dicke theory, are sufficiently close to general relativity 
(for large enough wo) that they conform to all cosmological observations, within the 
uncertainties. Certain generalized scalar-tensor theories, however, could have small values 
of w at early times making aspects of early universe evolution highly non general 
relativistic, while evolving to large w today, thereby agreeing with all solar system and 
astrophysical experiments, and hewing closely to late-time ACDM cosmology (Damour 
and Nordtvedt, 1993a,b). 

One way to test such theories is through Big-Bang nucleosynthesis (BBN), since the 
abundances of the light elements produced when the temperature of the universe was about 
1 MeV are sensitive to the rate of expansion at that epoch, which in turn depends on the 
field equations of the theory. Because the universe is radiation-dominated at that epoch, 
uncertainties in the amount of cold dark matter or dark energy (A) are unimportant. The 
nuclear reaction rates are reasonably well understood from laboratory experiments and 
theory, and the number of light neutrino families (3) conforms with evidence from particle 
accelerators. Thus, within modest uncertainties, one can assess the quantitative difference 
between the BBN predictions of general relativity and other theories of gravity under 
strong-field conditions and compare with observations. For recent analyses, primarily 
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within scalar-tensor theories see Santiago et al. (1997), Damour and Pichon (1999), Clifton 
et al. (2005), and Coc et al. (2006). 

A different class of theories has been developed in part to provide an alternative to the 
dark energy of the standard ACDM model, by modifying gravity on large, cosmological 
scales, while preserving the conventional solar and stellar-system phenomenology of 
general relativity. The f(R) theories are examples. Since we are now in a period of what 
may be called “precision cosmology,” one can begin to test alternative theories in this 
class using the accumulation of data on many fronts, including the growth of large-scale 
structure, cosmic background radiation fluctuations, galactic rotation curves, BBN, weak 
lensing, baryon acoustic oscillations, and so on. 

Apart from direct measurements of the expansion rate of the universe through the 
observations of standard candles such as Type II supernovae, most information about 
the universe comes from studying deviations from a homogeneous, isotropic Friedmann- 
Robertson-Walker background spacetime, expressed roughly in the form 


2 
ds? = —dt’ +a(t)’ ; = +7 (de + sin?) + Ayydxtdx” , (12.72) 
where ¢ is proper time as measured by a clock at rest, a(t) is the scale factor, here with units 
of distance, r, 6, ¢, and x" are dimensionless coordinates, and k = 0 or +1 determines 
the curvature of the spatial sections. Unlike the PPpN formalism, where the background 
spacetime was flat and it was relatively simple to characterize the deviations represented by 
hy», in the cosmological case, there are many different approaches, with a range of possible 
gauge choices. In addition, the evolution of the background will itself depend on the theory 
of gravity being studied. Finally the split between “background” and “perturbation” may 
not be as clean as it was in the post-Newtonian case. 

This has led to a rich variety of approaches to treating cosmological tests of gravitational 
theories, including Amin et al. (2008), Daniel et al. (2010), Dossett et al. (2011), Dossett 
and Ishak (2012), Zuntz et al. (2012), Hojjati et al. (2012), Baker et al. (2013), and Sanghai 
and Clifton (2017). For a comprehensive review, see Ishak (2018). This is another rapidly 
evolving field, and further details are beyond the scope of this book. 


